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FLOWERING TIME MODIFICATION 



The present invention claims priority in part from US Provisional Application Serial 
Nos. 60/159,464 filed October 12, 1999; 60/164,132 filed November 8, 1999; 60/166,228 filed 
5 November 17, 1999; 60/197,899 filed April 17, 2000; and Plant Trait Modification III, filed 

August 22, 2000. 



FIELD OF THE INVENTION 

This invention is in the field of plant molecular biology and relates to compositions and 
10 methods for modifying, a plant's flowering time or vernalization requirements. 

BACKGROUND OF THE INVENTION 

in order to maximize reproductive success, plants have evolved complex mechanisms 
to ensure that flowering occurs under favorable conditions. Analysis of late flowering mutants 

15 and ecotypes in Arabidopsis has revealed that such mechanisms are based upon several 

genetic pathways which may contain 80 or more genes (Martinez-Zapater and Somerville, 
(1990) Plant Physiol. 92:770-776; Koornneef et al. (1991) Moi. Gen. Genet. 229:57-66; EM 
Meyerwitz and CR Somerville Eds (1994) Arabidopsis pp 403-433 Cold Spring Harbor 
Laboratory Press, New York). Together these loci co-ordinate flowering time with 

20 environmental variables (e.g. day-length, temperature, light quality, and nutrient availability) 

and with the developmental stage of the plant. 

Arabidopsis flowers rapidly when grown under long day conditions of 16 hours or 
continuous light, but flowers much later under short day conditions of 8 or 10 hours light. 
Genes regulating this response constitute the photoperiod pathway and were identified by 

25 mutations that cause late flowering under long day conditions but which do not alter flowering 

in short day conditions. Examples from this group, which promote flowering in response to 
long days, include CONSTANS (CO), GIGANTEA (G/), FT, FWA, FE, FD, and FHA. A 
second group of genes, which includes LUMINIDEPENDENS (LD), FCA, FVE, FY, and FPA, 
form an autonomous pathway that is active under all day-length conditions. Mutants for this 

30 second class of genes flower later than wild type controls irrespective of the day length 

conditions (Koornneef et al. (1991) MoL Gen. Genet 229:57-66; EM Meyerwitz and CR 
Somerville Eds (1994) Arabidopsis pp 403-433 Cold Spring Harbor Laboratory Press, New 
York). 

In addition to differing in their response to day-length, mutants from the photoperiod 
35 and autonomous pathways show-a" differential response to prolonged cold (vernalization) 

treatments (Vince-Prue, (1975) Vernalization. In Photoperiodism in Plants pp 263-291, 
McGraw Hill, London) Through a vernalization response, Arabidopsis ecotypes from Northern 
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latitudes, such as Stockholm, are adapted to flower in the spring following exposure to cold 
winter conditions. This avoids flowering in the late summer when seed maturation might be 
curtailed by the onset of winter conditions (Reeves and Coupland, (2000) Curr. Opin. Plant 
Biol 3:37-42). When these ecotypes are grown in the laboratory they flower late f but will flower 
5 much earlier if subjected to a cold period of 4-6 weeks during seed germination. In a 

comparable manner, mutants from the autonomous pathway exhibit a very marked reduction 
in flowering time when subjected to vernalization. In contrast, mutants from the photoperiod 
pathway only show a minor response to cold treatments (Chandler et a/., (1996) Plant J. 
10:637-644; Koornneef et a/., (1998) Genetics 148:885-892). Thus, vernalization can 

10 overcome the requirement for the autonomous pathway conditions (Reeves and Coupland, 

(2000) Curr. Opin. Plant Biol 3:37-42). 

Two Arabidopsis genes, FLOWERING LOCUS C, FLO (also known as FLOWERING 
LOCUS F, FLF) and FRIGIDA (FRI), act in conjunction to repress flowering in the absence of 
a vernalization treatment (Napp-Zinn, K. (1957) Indukt. Abstammungs. Verebungsi 88:253- 

15 285; Napp-Zinn K. (1985) CRC Handbook of Flowering, Vol. 1, A. H. Halevy, pp 492-503; 

Clarke and Dean (1994) MoL Gen. Genet. 248:81-89; Koornneef. et al., (1994) Plant Journal 
6:911-919; Lee et al., (1994) Plant Journal 6:903-909.) Dominant functional alleles of FLC 
and FRI are found together in Northern European Arabidopsis ecotypes such as Pitztal and 
Stockholm. These ecotypes are extremely late flowering when non-vernalized. The widely 

20 used laboratory ecotype Columbia contains functional alleles at only one of these two loci and 

flower slightly later than strains such as Landsberg erecta which possess functional alleles of 
neither gene. The FRIGIDA protein sequence has not yet been published. However, the FLC 
gene has recently been cloned and shown to encode a MADS box protein (Sheldon C. et al., 
1999, Plant Ceil 11:445-458; Michaels S. and Amasino, R., 1999, Plant Cell 11:949-956). 

25 Dominant alleles and overexpression of FLC have been reported to delay flowering, while null 

flc mutants are early flowering (Lee et a/., (1994) Plant J. 6:903-909; Michaels and Amasino, 
(1999) Plant Cell 11:949-956; Sheldon et al., (1999; Proc. Natl. Acad. Sci. 97:3753-3758). 
Thus, FLC acts to prevent premature flowering. 

We have discovered transcription factors that regulate flowering time or vernalization 

30 requirements of plants. These transcription factors could therefore be useful to manipulate 

flowering characteristics of a plant. 

SUMMARY OF THE INVENTION 

35 In one aspect, the present invention relates to a transgenic plant comprising a 

recombinant polynucleotide. The recombinant polynucleotide comprises a nucleotide 
sequence encoding a polypeptide comprising at least 6 consecutive amino acids of a 
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sequence selected from the group consisting of SEQ ID Nos. 2N, where N=1-28 but excluding 
SEQ ID No. 28, and the presence of the recombinant polynucleotide alters the flowering time 
or vernalization requirements of the transgenic plant when compared with the same trait of 
another plant lacking the recombinant polynucleotide. 
5 In one embodiment, the nucleotide sequence encodes a polypeptide comprising a 

conserved domain such as 1) a localization domain, 2) an activation domain, 3) a repression 
domain, 4) an oligomerization domain or 5) a DNA binding domain of SEQ ID Nos. 2N, where 
N=1-28. In another embodiment, the recombinant polynucleotide encodes a polypeptide 
comprising a conserved domain having greater than an 84% sequence identity to a sequence 

10 selected from the group consisting of SEQ ID Nos. 2N, where N=1-28. In a further 

embodiment, the nucleotide sequence further comprises a promoter operably linked to the 
nucleotide sequence. The promoter may be a constitutive or inducible or tissue-active. 

In a second aspect, the present invention relates to a method for altering a plant's 
flowering time or vernalization requirements. The method comprises (a) transforming a plant 

15 with a recombinant polynucleotide comprising a nucleotide sequence encoding a polypeptide 

comprising at least 6 consecutive amino acids of a sequence selected from the group 
consisting of SEQ ID Nos. 2N, where N=1-28; (b) selecting transformed plants; and (c) 
identifying a transformed plant with the desired trait. 

In one embodiment, the nucleotide sequence encodes a polypeptide comprising a 

20 conserved domain such as 1) a localization domain, 2) an activation domain, 3) a repression 

domain, 4) an oligomerization domain or 5) a DNA binding domain domain of SEQ ID Nos. 
2N, where N=1-28 but excluding SEQ ID No. 28. in another embodiment, the recombinant 
polynucleotide encodes a polypeptide comprising a conserved domain having greater than an 
84% sequence identity to a sequence selected from the group consisting of SEQ ID Nos. 2N, 

25 where N=1-28. In a further embodiment, the nucleotide sequence further comprises a 

promoter operably linked to the nucleotide sequence. The promoter may be a constitutive or 
inducible or tissue-active. 

In a third aspect, the present invention relates to another method for altering a plant 
trait associated with flowering time or the plant's vernalization requirements. The method 

30 comprises (a) transforming the plant with a recombinant polynucleotide comprising a 

nucleotide sequence comprising at least 18 consecutive nucleotides of a sequence selected 
from the group consisting of SEQ ID Nos. 2N-1, where N= 1-28 but excluding SEQ ID No. 27; 
and (b) selecting said transformed plant. 

In yet another aspect, the present invention is yet another method for altering a plant's 

35 flowering time or vernalization requirements. The method comprises (a) providing a database 

sequence; (b) comparing the database sequence with a polypeptide selected from SEQ ID 
Nos. 2N, where N= 1-28; (c) selecting a database sequence that meets selected sequence 
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criteria; and (d) transforming said database sequence in the plant. Alternatively, the database 
sequence can be compared with a polynucleotide selected from SEQ ID Nos. 2N-1 f where N= 
1-28. 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 provides a table of exemplary polynucleotide and polypeptide sequences of 
10 the invention. The table includes from left to right for each sequence: the SEQ ID No., the 

internal code reference number, whether the sequence is a polynucleotide or polypeptide 
sequence, and identification of any conserved domains for the polypeptide sequences. 

Figure 2 provides a table of sequences that are homologous to the sequences 
provided in the Sequence Listing. The table includes from left to right: the SEQ ID No., the 
15 internal code reference number, the unique Genbank sequence ID No. (NID), the probability 

that the comparison was generated by chance (P-value), and the species from which the 
homologous gene was identified. 

DETAILED DESCRIPTION OF THE INVENTION 

20 DEFINITIONS 

A "recombinant polynucleotide" is a nucleotide sequence comprising a gene coding 
sequence or a fragment thereof (comprising at least 18 consecutive nucleotides, preferably at 
least 30 consecutive nucleotides, and more preferably at least 50 consecutive nucleotides). 

25 Additionally, the polynucleotide may comprise a promoter, an intron, an enhancer region, a 

polyadenylation site, a translation initiation site, 5' or 3' untranslated regions, a reporter gene, 
a selectable marker or the like. The polynucleotide may comprise single stranded or double 
stranded DNA or RNA. The polynucleotide may comprise modified bases or a modified 
backbone. The polynucleotide may be genomic, a transcript (such as an mRNA) or a 

30 processed nucleotide sequence (such as a cDNA). The polynucleotide may comprise a 

sequence in either sense or antisense orientations. 

A "recombinant polynucleotide" is a polynucleotide that is not in its native state, e.g., 
the polynucleotide is comprised of a nucleotide sequence not found in nature or the 
polynucleotide is separated from nucleotide sequences with which it typically is in proximity or 

35 is next to nucleotide sequences with which it typically is not in proximity. 

A "recombinant polypeptide" is a polypeptide derived from the translation of a 
recombinant polynucleotide or is more enriched in a cell than the polypeptide in its natural 
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state in a wild type cell, e.g. more than 5% enriched, more than 10% enriched or more than 
20% enriched and is not the result of a natural response of a wild type plant or is separated 
from other components with which it is typically associated with in a cell. 

A "transgenic plant" may refer to a plant that contains genetic material not normally 
found in a wild type plant of the same species, or in a naturally occurring variety or in a 
cultivar, and which has been introduced into the plant by human manipulation. A transgenic 
plant is a plant that may contain an expression vector or cassette. The expression cassette 
comprises a gene coding sequence and allows for the expression of the gene coding 
sequence. The expression cassette may be introduced into a plant by transformation or by 
breeding after transformation of a parent plant. 

A transgenic plant refers to a whole plant as well as to a plant part, such as seed, fruit, 
leaf, or root, plant tissue, plant cells, protoplasts or any other plant material, and progeny 
thereof. 

The phrase "altered expression" in reference to polynucleotide or polypeptide 
expression refers to an expression pattern in the transgenic plant that is different from the 
expression pattern in the wild type plant or a reference; for example, by expression in a cell 
type other than a cell type in which the sequence is expressed in the wild type plant, or by 
expression at a time other than at the time the sequence is expressed in the wild type plant, or 
by a response to different inducible agents, such as hormones or environmental signals, or at 
different expression levels (either higher or lower) compared with those found in a wild type 
plant. The term also refers to lowering the levels of expression to below the detection level or 
completely abolishing expression. The resulting expression pattern may be transient or 
stable, constitutive or inducible. 

A "transcription factor" (TF) refers to a polynucleotide or polypeptide that controls the 
expression of a gene or genes either directly by binding to one or more nucleotide sequences 
associated with a gene coding sequence or indirectly by affecting the level or activity of other 
polypeptides that do bind directly or indirectly to one or more nucleotide sequences associated 
with a gene coding sequence. A TF, in this definition, includes any polypeptide that can 
activate or repress transcription of a single gene or a number of genes. This polypeptide 
group includes, but is not limited to, DNA binding proteins, protein kinases, protein 
phosphatases, GTP-binding proteins and receptors. 

The transcription factor sequence may comprise a whole coding sequence or a 
fragment or domain of a coding sequence. A "fragment or domain", as referred to 
polypeptides, may be a portion of a polypeptide which performs at least one biological function 
of the intact polypeptide in substantially the same manner or to a similar extent as does the 
intact polypeptide. A fragment may comprise, for example, a DNA binding domain that binds 
to a specific DNA promoter region, an activation domain or a domain for protein-protein 
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interactions. Fragments may vary in size from as few as 6 amino acids to the length of the 
intact polypeptide, but are preferably at least 30 amino acids in length and more preferably at 
least 60 amino acids in length. In reference to a nucleotide sequence u a fragment" refers to 
any sequence of at least consecutive 18 nucleotides, preferably at least 30 nucleotides, more 
preferably at least 50, of any of the sequences provided herein. 

Exemplary polynucleotides and polypeptides comprise a sequence provided in the 
Sequence Listing as SEQ ID No. 1: G157 (cDNA); SEQ ID No. 2: G157 (protein); SEQ ID No. 
3: G859 (cDNA); SEQ ID No. 4: G859 (protein); SEQ ID No. 5: G859.1 (cDNA); SEQ ID No. 6: 
G859.1 (protein); SEQ ID No. 7: G859.2 (cDNA); SEQ ID No. 8: G859.2 (protein); SEQ ID No. 
9: G1842 (cDNA); SEQ ID No. 10: G1842 (protein); SEQ ID No. 11: G1842.2 (cDNA); SEQ ID 
No. 12: G1842.2 (protein); SEQ ID No. 13: G1842.6 (cDNA); SEQ ID No. 14: G1842.6 
(protein); SEQ ID No. 15: G1842.7 (cDNA); SEQ ID No. 16: G1842.7 (protein); SEQ ID No. 
17: G1843 (cDNA); SEQ ID No. 18: G1843 (protein); SEQ ID No. 19: G1844 (cDNA); SEQ ID 
No. 20: G1844 (protein); SEQ ID No. 21: G1844.2 (cDNA); SEQ ID No. 22: G1844.2 (protein); 
SEQ ID No. 23: G861 (cDNA); SEQ ID No. 24: G861 (protein); SEQ ID No. 25: G861.1 
(cDNA); SEQ ID No. 26: G861.1 (protein); SEQ ID No. 27: G1759 (cDNA); SEQ ID No. 28: 
G1759 (protein); SEQ ID No. 29: G192 (cDNA); SEQ ID No. 30: G192 (protein); SEQ ID No. 
31: G234 (cDNA); SEQ ID No. 32: G234 (protein); SEQ ID No. 33: G361 (cDNA); SEQ ID No. 
34: G361 (protein); SEQ ID No. 35: G486 (cDNA); SEQ ID No. 36: G486 (protein); SEQ ID 
No. 37: G748 (cDNA); SEQ ID No. 38: G748 (protein); SEQ ID No. 39: G994 (cDNA); SEQ ID 
No. 40: G994 (protein); SEQ ID No. 41: G1335 (cDNA); SEQ ID No. 42: G1335 (protein); SEQ 
ID No. 43: G562 (cDNA); SEQ ID No. 44: G562 (protein); SEQ ID No. 45: G736 (cDNA); SEQ 
ID No. 46: G736 (protein); SEQ ID No. 47: G1073 (cDNA); SEQ ID No. 48: G1073 (protein); 
SEQ ID No. 49: G1435 (cDNA); SEQ ID No. 50: G1435 (protein); SEQ ID No. 51: G180 
(cDNA); SEQ ID No. 52: G180 (protein); SEQ ID No. 53: G592 (cDNA); SEQ ID No. 54: G592 
(protein); SEQ ID No. 55: G208 (cDNA); and SEQ ID No. 56: G208 (protein). 

A "conserved domain" refers to a polynucleotide or polypeptide fragment that is more 
conserved at a sequence level than other fragments when the polynucleotide or polypeptide is 
compared with homologous genes or proteins from other plants. The conserved domain may 
be 1) a localization domain, 2) an activation domain, 3) a repression domain, 4) a dimerization 
or oligomerization domain, 5) a DNA binding domain or any combination thereof. For MADS 
proteins, the conserved domain is typically a DNA-binding domain. 

A nucleotide sequence is "operably linked" when it is placed into a functional 
relationship with another nucleotide sequence. For example, a promoter or enhancer is 
operably linked to a gene coding sequence if the presence of the promoter or enhancer 
increases the level of expression of the gene coding sequence. 
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"Trait" refers to a physiological, morphological, biochemical or physical characteristic 
of a plant or particular plant material or cell. This characteristic may be visible to the human 
eye, such as seed or plant size, or be measured by biochemical techniques, such as the 
protein, starch or oil content of seed or leaves or by the observation of the expression level of 
5 genes by employing Northerns, RT PGR, microarray gene expression assays or reporter gene 

expression systems or be measured by agricultural observations such as stress tolerance, 
yield or disease resistance. 

"Trait modification" refers to a detectable difference in a characteristic in a transgenic 
plant with modified expression of a polynucleotide or polypeptide of the present invention 

10 relative to a plant not doing so, such as a wild type plant. The trait modification may entail at 

least a 5% increase or decrease in an observed trait (difference), at least a 10% difference, at 
least a 20% difference, at least a 30%, at least a 50%, at least a 70%, at least a 100% or a 
greater difference. It is known that there may be a natural variation in the modified trait. 
Therefore, the trait modification observed entails a change in the normal distribution of the trait 

15 in transgenic plants compared with the distribution observed in wild type plant. 

Trait modifications of particular interest include those to seed (embryo), fruit, root, 
flower, leaf, stem, shoot, seedling or the like, including: enhanced tolerance to environmental 
conditions including freezing, chilling, heat, drought, water saturation, radiation and ozone; 
enhanced resistance to microbial, fungal or viral diseases; resistance to nematodes, 

20 decreased herbicide sensitivity, enhanced tolerance of heavy metals (or enhanced ability to 

take up heavy metals), enhanced growth under poor photpconditions (e.g., low light and/or 
short day length), or changes in expression levels of genes of interest. Other phenotypes that 
may be modified relate to the production of plant metabolites, such as variations in the 
production of taxol, tocopherol, tocotrienol, sterols, phytosterols, vitamins, wax monomers, 

25 anti-oxidants, amino acids, lignins, cellulose, tannins, prenyliipids (such as chlorophylls and 

carotenoids), glucosinolates, and terpenoids, enhanced or compositionally altered protein or 
oil production (especially in seeds), or modified sugar (insoluble or soluble) and/or starch 
composition. Physical plant characteristics that may be modified include cell development 
(such as the number of trichomes), fruit and seed size and number, yields of plant parts such 

30 as stems, leaves and roots, the stability of the seeds during storage, characteristics of the 

seed pod (e.g., susceptibility to shattering), root hair length and quantity, internode distances, 
or the quality of seed coat. Plant growth characteristics that may be modified include growth 
rate, germination rate of seeds, vigor of plants and seedlings, leaf and flower senescence, 
male sterility, apomixis, flowering time, flower abscission, rate of nitrogen uptake, biomass or 

35 transpiration characteristics, as well as plant architecture characteristics such as apical 

dominance, branching patterns, number of organs, organ identity, organ shape or size. 
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Of particular interest are traits relating to modified vernalization requirements or 
flowering time characteristics, such as changes in flowering time in response to day-length, in 
response to temperature, in response to light quality, nutrient availability, and development 
stage of the plant, and the like. 

5 

1. The Sequences 

We have discovered particular plant transcription factors (TFs) that can be employed 
to modify the flowering time of a plant. Therefore, the flowering time of plants can be either 
decreased, increased, or made inducible under specific conditions using the TFs of this 

10 invention. Additionally, the transcription factors can be used to modify the vernalization 

• requirements of the plant. 

The plant transcription factors may belong to one of the following transcription factor 
families: the AP2 (APETALA2) domain transcription factor family (Riechmann and Meyerowitz 
(1998) Biol. Chem. 379:633-646); the MYB transcription factor family (Martin and Paz-Ares, 

15 (1997) Trends Genet. 13:67-73); the MADS domain transcription factor family (Riechmann 

and Meyerowitz (1997) Biol. Chem. 378:1079-1 101); the WRKY protein family (Ishiguro and 
Nakamura (1994) Mol. Gen. Genet. 244:563-571); the ankyrin-repeat protein family (Zhang et 
al. (1992) Plant Cell 4:1 575-1 588); the zinc finger protein (Z) family (Klug and Schwabe (1995) 
FASEB J. 9: 597-604); the homeobox (HB) protein family (Duboule (1994) Guidebook to the 

20 Homeobox Genes, Oxford University Press); the CAAT-element binding proteins (Forsburg 

and Guarente (1989) Genes Dev. 3:1166-1178); the squamosa promoter binding proteins 
(SPB) (Klein et al. (1996) Mol. Gen. Genet. 1996 250:7-16); the NAM protein family (Souer et 
al. (1996) Cell 85:159-170); the IAA/AUX proteins (Rouse et al. (1998) Science 279:1371- 
1373); the HLH/MYC protein family (Littlewood et al. (1994) Prot Profile 1:639-709); the DNA- 

25 binding protein (DBP) family (Tucker et al. (1994) EMBO J. 13:2994-3002); the bZIP family of 

transcription factors (Foster et al. (1994) FASEB J. 8:192-200); the Box P-binding protein (the 
BPF-1) family (da Costa e Silva et al. (1993) Plant J. 4:125-135); the high mobility group 
(HMG) family (Bustin and Reeves (1996J Prog. Nucl. Acids Res. Mol. Biol. 54:35-100); the 
scarecrow (SCR) family (Di Laurenzio et al. (1996) Cell 86:423-433); the GF14 family (Wu et 

30 al. (1997) Plant Physiol. 114:1421-1431); the polycomb (PCOMB) family (Kennison (1995) 

Annu. Rev. Genet. 29:289-303); the teosinte branched (TEO) family (Luo et a), (1996) Nature 
383:794-799; the ABI3 family (Giraudat et al. (1992) Plant Ce//4:1251-1261); the triple helix 
(TH) family (Dehesh et al. (1990) Science 250:1397-1399); the EIL family (Chao et al. (1997) 
Cell 89: 11 33-44); the AT-HOOK family (Reeves and Nissen (1990) Journal of Biological 

35 Chemistry 265:8573-8582); the S1 FA family (Zhou et al. (1995) Nucleic Acids Res. 23:1 165- 

1 169); the bZIPT2 family (Lu and Fert (1995J Plant Physiol. 109:723); the YABBY family 
(Bowman et al. (1999) Development 126:2387-96); the PAZ family (Bohmert et al. (1998) 
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EMBO J. 17:170-80); a family of miscellaneous (MISC) transcription factors including the 
DPBF family (Kim et al. (1997) Plant J. 1 1 :1 237-1251 ) and the SPF1 family (Ishiguro and 
Nakamura (1994) Mol. Gen. Genet 244:563-571); the golden (GLD) family (Hall et al. (1998) 
Plant Cell 10:925-936), and the TUBBY family (Boggin et al. (1999) Science 286:2119-2125) 

In particular, the TFs that we have discovered that are implicated in flowering time or 
vernalization include members of the MADS transcription factor family, the MYB family, the 
WRKY family, the HLH/MYC family, GLD family, AT-HOOK family, the CAAT family, the bZIP 
family, and members of zinc coordinating protein families (Z-Dof, Z-CLDSH and Z-CH2H2). In 
fact we have identified the first members of the WRKY, CAAT, bZIP, AT-HOOK and HLH/MYC 
families that are associated with flowering time modification in plants: G192 and G190 
(WRKY), G486 (CAAT), G562 (bZIP), G1073 (AT-HOOK) and G592 (HLH/MYC). 

The polynucleotides and polypeptides are provided in the Sequence Listing and are 
tabulated in Figure 1. Figure 1 identifies a SEQ ID No., its corresponding GID number, 
whether the sequence is a polynucleotide or a polypeptide sequence, and indicates the 
conserved domains. We have also identified domains or fragments derived from each of the 
sequences in the Sequence Listing. The fragments can be from any region of the sequence, 
can be of any length up to the length of the sequence, and can be as short as six residues for 
protein and 18 nucleotides for DNA. Exemplary fragments of the DNA sequences are as 
follows: 1-50, 51-100, 101-200, 201-218, 218-300, 301-450 and 450-600; and exemplary . 
fragments of proteins are as follows 1-50, 51-100, 101-200, 201-206, 206-250, 251-300. For 
DNA sequences, the numbers may be measured from either 5' or 3' end of the DNA. For the 
protein sequences the fragment location is determined from the N-terminus or C-terminus of 
the protein and may include adjacent amino acid sequences, such as for example for SEQ ID 
No. 2 an additional 10, 20, 40, 60 or 100 amino acids in either N-terminal or C-terminal 
direction of the described fragments. 

The identified polypeptide fragments may be linked to fragments or sequences 
derived from other transcription factors so as to generate additional novel sequences, such as 
by employing the methods described in Short, PCT publication WO9827230, entitled "Methods 
and Compositions for Polypeptide Engineering" or in Patten et al., PCT publication 
W09923236, entitled "Method of DNA Shuffling" or in Minshull and Stemmer, US Patent No. 
5,837,458. Alternatively, the identified fragment may be linked to a transcription activation 
domain. A transcription activation domain assists in initiating transcription from a DNA binding 
site. A common feature of some activation domains is that they are designed to form 
amphiphilic alpha helices with excess positive or negative charge (Giniger and Ptashne (1987) 
Nature 330:670-672, Gill and Ptashne (1987) Cell 51:121-126, Estruch et al (1994) Nucl. 
Acids Res. 22:3983-3989). Examples include the transcription activation region of VP16 or 
GAL4( Moore etal. (1998) Proc. Natl. Acad. ScL USA 95: 376-381; and Aoyama et al. 
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(1995) Plant Cell 7:1773-1785), peptides derived from bacterial sequences (Ma and Ptashne 
(1987) Cell 51; 113-119) and synthetic peptides (Giniger and Ptashne, supra). 

The isolated polynucleotides and polypeptides may be used to modify plant 
development, physiology or biochemistry such that the modified plants have a trait advantage 
over wild type plants. The identified polynucleotide fragments are also useful as nucleic acid 
probes and primers. A nucleic acid probe is useful in hybridization protocols, including 
protocols for microarray experiments. Primers may be annealed to a complementary target 
DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target 
DNA strand, and then extended along the target DNA strand by a DNA polymerase enzyme. 
Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase 
chain reaction (PCR) or other nucleic-acid amplification methods. See Sambrook et aL, 
Molecular Cloning. A Laboratory Manual, Ed. 2, Cold Spring Harbor Laboratory Press, New 
York (1989) and Ausubel et a!, (eds) Current Protocols in Molecular Biology, John Wiley & 
Sons (1998). 

2. Identification of Homologous Sequences (Homologs) 

Homologous sequences to those provided in the Sequence Listing derived from 
Arabidopsis thaliana or from other plants may be used to modify a plant trait. Homologous 
sequences may be derived from any plant including monocots and dicots and in particular 
agriculturally important plant species, including but not limited to, crops such as soybean, 
wheat, corn, potato, cotton, rice, oilseed rape (including canola), sunflower, alfalfa, sugarcane 
and turf; or fruits and vegetables, such as banana, blackberry, blueberry, strawberry, and 
raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, 
lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet 
corn, tobacco, tomato, watermelon, rosaceous fruits (such as apple, peach, pear, cherry and 
plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, brussel sprouts and 
kohlrabi). Other crops, fruits and vegetables whose phenotype may be changed include 
barley, currant, avocado, citrus fruits such as oranges, lemons, grapefruit and tangerines, 
artichoke, cherries, nuts such as the walnut and peanut, endive, leek, roots, such as 
arrowroot, beet, cassava, turnip, radish, yam, sweet potato and beans. The homologs may 
also be derived from woody species, such pine, poplar and eucalyptus. 

Substitutions, deletions and insertions introduced into the sequences provided in the 
Sequence Listing are also envisioned by the invention. Such sequence modifications can be 
engineered into a sequence by site-directed mutagenesis (Wu (ed.) Meth. Enzymoi (1993) 
vol. 217, Academic Press). Amino acid substitutions are typically of single residues; 
insertions usually will be on the order of about from 1 to 10 amino acid residues; and deletions 
will range about from 1 to 30 residues. In preferred embodiments, deletions or insertions are 
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made in adjacent pairs, e.g., a deletion of two residues or insertion of two residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a 
sequence. The mutations that are made in the polynucleotide encoding the transcription factor 
should not place the sequence out of reading frame and should not create complementary 
regions that could produce secondary mRNA structure. 

Substitutions are those in which at least one residue in the amino acid sequence has 
been removed and a different residue inserted in its place. Such substitutions may be 
conservative with little effect on the function of the gene, for example by substituting alanines 
for serines, arginines for lysines, glutamate for aspartate and the like. The substitutions which 
are not conservative are expected to produce the greatest changes in protein properties will 
be those in which (a) a hydrophilic residue, e.g., seryl or threonyl, is substituted for (or by) a 
hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or 
proline is substituted for (or by) any other residue; (c) a residue having an electropositive side 
chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is 
substituted for (or by) one not having a side chain, e.g., glycine. 

Additionally, the term "homologous sequence" may encompass a polypeptide 
sequence that is modified by chemical or enzymatic means. The homologous sequence may 
be a sequence modified by lipids, sugars, peptides, organic or inorganic compounds, by the 
use of modified amino acids or the like. Protein modification techniques are illustrated in 
Ausube! et al. (eds) Current Protocols in Molecular Biology, John Wiley & Sons (1998). 

Homologous sequences also may mean two sequences having a substantial 
percentage of sequence identity after alignment as determined by using sequence analysis 
programs for database searching and sequence alignment and comparison available, for 
example, from the Wisconsin Package Version 10.0, such as BLAST, FASTA, PILEUP, 
FINDPATTERNS or the like (GCG, Madision, Wl). Public sequence databases such as 
GenBank, EMBL, Swiss-Prot and PIR or private sequence databases such as PhytoSeq 
(Incyte Pharmaceuticals, Palo Alto, CA) may be searched. Alignment of sequences for 
comparison may be conducted by the local homology algorithm of Smith and Waterman 
(1981) Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman and 
Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson and 
Lipman (1988) Proc. Natl. Acad. Set. U.S.A. 85: 2444, by computerized implementations of 
these algorithms. After alignment, sequence comparisons between two (or more) 
polynucleotides or polypeptides are typically performed by comparing sequences of the two 
sequences over a comparison window to identify and compare local regions of sequence 
similarity. The comparison window may be a segment of at least about 20 contiguous 
positions, usually about 50 to about 200, more usually about 100 to about 150 contiguous 
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positions. A description of the method is provided in Ausubel et al. (eds) (1999) Current 
Protocols in Molecular Biology, John Wiley & Sons. 

Transcription factors that are homologs of the disclosed sequences will typically share 
at least 40% amino acid sequence identity. More closely related TFs may share at least 50%, 
60%, 65%, 70%, 75% or 80% sequence identity with the disclosed sequences. Factors that 
are most closely related to the disclosed sequences share at least 85%, 90% or 95% 
sequence identity. At the nucleotide level, the sequences will typically share at least 40% 
nucleotide sequence identity, preferably at least 50%, 60%, 70% or 80% sequence identity, 
and more preferably 85%, 90%, 95% or 97% sequence identity. The degeneracy of the 
genetic code enables major variations in the nucleotide sequence of a polynucleotide while 
maintaining the amino acid sequence of the encoded protein. 

One way to identify whether two nucleic acid molecules are closely related is that the 
two molecules hybridize to each other under stringent conditions. Generally, stringent conditions 
are selected to be about 5°C to 20°C lower than the thermal melting point (Tm) for the specific 
sequence at a defined ionic strength and pH. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. 
Conditions for nucleic acid hybridization and calculation of stringencies can be found in 
Sambrook et al. (1989) Molecular Cloning. A Laboratory Manual, Ed. 2, Cold Spring Harbor 
Laboratory Press, New York and Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology-Hybridization with Nucleic Acid Probes Part I, Elsevier, New York . Nucleic 
acid molecules that hybridize under stringent conditions will typically hybridize to a probe based 
on either the entire cDNA or selected portions of the cDNA under wash conditions of 0.2x SSC 
to 2.0 x SSC, 0.1 % SDS at 50-65° C, for example 0.2 x SSC, 0.1 % SDS at 65° C. For detecting 
less closely related homologs washes may be performed at 50° C. 

For conventional hybridization the hybridization probe is conjugated with a detectable 
label such as a radioactive label, and the probe is preferably of at least 20 nucleotides in 
length. As is well known in the art, increasing the length of hybridization probes tends to give 
enhanced specificity. The labeled probe derived from the Arabidopsis nucleotide sequence 
may be hybridized to a plant cDNA or genomic library and the hybridization signal detected 
using means known in the art. The hybridizing colony or plaque (depending on the type of 
library used) is then purified and the cloned sequence contained in that colony or plaque 
isolated and characterized. Homologs may also be identified by PCR-based techniques, such 
as inverse PCR or RACE, using degenerate primers. See Ausubel et al. (eds) (1998) Current 
Protocols in Molecular Biology, John Wiley & Sons. 

TF homologs may alternatively be obtained by immunoscreening an expression library. 
With the provision herein of the disclosed TF nucleic acid sequences, the polypeptide may be 
expressed and purified in a heterologous expression system (e.g., E. colt) and used to raise 
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antibodies (monoclonal or polyclonal) specific for the TF. Antibodies may also be 'raised against 
synthetic peptides derived from TF amino acid sequences. Methods of raising antibodies are 
well known in the art and are described in Harlow and Lane (1988) Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory, New York. Such antibodies can then be used to screen 
5 an expression library produced from the plant from which it is desired to clone the TF homolog, 

using the methods described above. The selected cDNAs may be confirmed by sequencing and 
biological activity. 

3. Altered Expression of Transcription Factors 

10 Any of the identified sequences may be incorporated into a cassette or vector for 

expression in plants. A number of expression vectors suitable for stable transformation of plant 
cells or for the establishment of transgenic plants have been described including those 
described in Weissbach and Weissbach, (1989,) Methods for Plant Molecular Biology, 
Academic Press, and Gelvin et al., (1990) Plant Molecular Biology Manual, Kluwer Academic 

15 Publishers. Specific examples include those derived from a Ti plasmid of Agrobacterium 

tumefaciens, as well as those disclosed by Herrera-Estrella, L., et al. t (1983) Nature 303: 209, 
Bevan, M., Nucl. Acids Res. (1984) 12: 8711-8721, Klee, H. J., (1985) Bio/Technology 3: 637- 
642, for dicotyledonous plants. Ti-derived plasmids can be transferred into both 
monocotonous and docotyledonous species using Agrobacterium-mediated transformation 

20 (Ishida et al (1996) Nat. Biotechnol. 14:745-50; Barton et al. (1983) Cell 32:1033-1043). 

Alternatively, non-Ti vectors can be used to transfer the DNA into plants and cells by 
using free DNA delivery techniques. Such methods may involve, for example, the use of 
liposomes, electroporation, microprojectile bombardment, silicon carbide wiskers, and viruses. 
By using these methods transgenic plants such as wheat, rice (Christou, P., (1991) 

25 Bio/Technology 9: 957-962) and corn (Gordon-Kamm, W., (1990) Plant Cell 2: 603-618) can 

be produced. An immature embryo can also be a good target tissue for monocots for direct 
DNA delivery techniques by using the particle gun (Weeks, T. et al., (1993) Plant Physiol. 102: 
1077-1084; Vasil, V M (1993) Bio/Technology 10: 667-674; Wan, Y. and Lemeaux, P., (1994) 
Plant Physiol. 104: 37-48, and for Agrobacterium-me6\ated DNA transfer (Ishida et al., (1996) 

30 Nature Biotech. 14: 745-750). 

Typically, plant transformation vectors include one or more cloned plant coding 
sequences (genomic or cDNA) under the transcriptional control of 5* and 3' regulatory 
sequences and a dominant selectable marker. Such plant transformation vectors typically 
also contain a promoter (e.g., a regulatory region controlling inducible or constitutive, 

35 environmentally-or developmentally-regulated, or cell- or tissue-specific expression), a 

transcription initiation start site, an RNA processing signal (such as intron splice sites), a 
transcription termination site, and/or a polyadenylation signal. 
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Examples of constitutive plant promoters which may be useful for expressing the TF 
sequence include: the cauliflower mosaic virus (CaMV) 35S promoter, which confers 
constitutive, high-level expression in most plant tissues (see, e.g., Odel et al., (1985) Nature 
313:810); the nopaline synthase promoter (An et al., (1988) Plant Physiol. 88:547); and the 
octopine synthase promoter (Fromm et al., (1989) Plant Cell 1: 977). 

A variety of plant gene promoters that regulate gene expression in response to 
environmental, hormonal, chemical, developmental signals, and in a tissue-active manner can 
be used for expression of the TFs in plants, as illustrated by seed-specific promoters (such as 
the napin, phaseolin or DC3 promoter described in US Pat. No. 5,773,697), root-specific 
promoters, such as those disclosed in US Patent Nos. 5,618,988, 5,837,848 and 5,905,186; 
fruit-specific promoters that are active during fruit ripening (such as the dru 1 promoter (US 
Pat. No. 5,783,393), or the 2A1 1 promoter (US Pat. No. 4,943,674) and the tomato 
polygalacturonase promoter (Bird et al. (1 988) Plant Mol. Biol. 1 1 :651 ), root-specific 
promoters, such as those disclosed in US Patent Nos. 5,618,988, 5,837,848 and 5,905,186, 
pollen-active promoters such as PTA29, PTA26 and PTA13 (US Pat. No. 5,792,929), 
promoters active in vascular tissue (Ringli and Keller (1998) Plant Mol. Biol. 37:977-988), 
flower-specific (Kaiser et al, (1995) Plant Mol. Biol. 28:231-243), pollen (Baerson et al. (1994) 
Plant Mol. Biol. 26:1947-1959), carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and 
ovules (Baerson et al. (1993) Plant Mol. Biol. 22:255-267), auxin-inducible promoters (such as 
that described in van der Kop et al (1999) Plant Mol. Biol. 39:979-990 or Baumann et al. 
(1999) Plant Cell 1 1 :323-334), cytokinin-inducibie promoter (Guevara-Garcia (1998) Plant Mot. 
Biol. 38:743-753), promoters responsive to gibberellin (Shi et al. (1998) Plant Mol. Biol, 
38:1053-1060, Willmott et al. (1998) 38:817-825) and the like. Additional promoters are those 
that elicit expression in response to heat (Ainley, et al. (1993) Plant Mol. Biol. 22: 13-23), light 
(e.g., the pea rbcS-3A promoter, Kuhlemeier et al., (1989) Plant Cell 1:471, and the maize 
rbcS promoter, Schaffner and Sheen, (1991) Plant Ce//3: 997); wounding (e.g., wunl, Siebertz 
et al., (1989) Plant Cell 1: 961); pathogen resistance, and chemicals such as methyl 
jasmonate or salicylic acid.(Gatz et al., (1997) Plant Mol. Biol. 48: 89-108). In addition, the 
timing of the expression can be controlled by using promoters such as those acting at late seed 
development (Odell et al. (1994) Plant Physiol. 106:447-458). 

Plant expression vectors may also include RNA processing signals that may be 
positioned within, upstream or downstream of the coding sequence. In addition, the 
expression vectors may include additional regulatory sequences from the 3-untranslated 
region of plant genes, e.g., a 3* terminator region to increase mRNA stability of the mRNA, 
such as the Pl-ll terminator region of potato or the octopine or nopaline synthase 3* terminator 
regions. 
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Finally, as noted above, plant expression vectors may also include dominant 
selectable marker genes to allow for the ready selection of transformants. Such genes include 
those encoding antibiotic resistance genes (e.g., resistance to hygromycin, kanamycin, 
bleomycin, G418, streptomycin or spectinomycin) and herbicide resistance genes (e.g., 
phosphinothricin acetyltransferase). 

A reduction of TF expression in a transgenic plant to modifiy a plant trait may be 
obtained by introducing into plants antisense constructs based on the TF cDNA. For 
antisense suppression, the TF cDNA is arranged in reverse orientation relative to the promoter 
sequence in the expression vector. The introduced sequence need not be the full length TF 
cDNA or gene, and need not be identical to the TF cDNA or a gene found in the plant type to 
be transformed. Generally, however, where the introduced sequence is of shorter length, a 
higher degree of homology to the native TF sequence will be needed for effective antisense 
suppression. Preferably, the introduced antisense sequence in the vector will be at least 30 
nucleotides in length, and improved antisense suppression will typically be observed as the 
length of the antisense sequence increases. Preferably, the length of the antisense sequence 
in the vector will be greater than 100 nucleotides. Transcription of an antisense construct as 
described results in the production of RNA molecules that are the reverse complement of 
mRNA molecules transcribed from the endogenous TF gene in the plant cell. Suppression of 
endogenous TF gene expression can also be achieved using a ribozyme. Ribozymes are 
synthetic RNA molecules that possess highly specific endoribonuclease activity. The 
production and use of ribozymes are disclosed in U.S. Patent No. 4,987,071 to Cech and U.S. 
Patent No. 5,543,508 to Haselhoff. The inclusion of ribozyme sequences within antisense 
RNAs may be used to confer RNA cleaving activity on the antisense RNA, such that 
endogenous mRNA molecules that bind to the antisense RNA are cleaved, which in turn leads 
to an enhanced antisense inhibition of endogenous gene expression. 

Vectors in which RNA encoded by the TF cDNA (or variants thereof) is over- 
expressed may also be used to obtain co-suppression of the endogenous TF gene in the 
manner described in U.S. Patent No. 5,231,020 to Jorgensen. Such co-suppression (also 
termed sense suppression) does not require that the entire TF cDNA be introduced into the 
plant cells, nor does it require that the introduced sequence be exactly identical to the 
endogenous TF gene. However, as with antisense suppression, the suppressive efficiency will 
be enhanced as (1) the introduced sequence is lengthened and (2) the sequence similarity 
between the introduced sequence and the endogenous TF gene is increased. 

Vectors expressing an untranslatable form of the TF mRNA may also be used to 
suppress the expression of endogenous TF activity to modify a trait. Methods for producing such 
constructs are described in U.S. Patent No. 5,583,021 to Dougherty et al. Preferably, such 
constructs are made by introducing a premature stop codon into the TF gene. Alternatively, a 
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plant trait may be modified by gene silencing using double-strand RNA (Sharp (1999) Genes and 
Development 13: 139-141). This approach, whereby a vector is prepared in which a cDNA or 
gene is arranged in duplicated fashion and is capable of generating upon expression a double 
stranded RNA molecule with a hairpin structure. This procedure has been used to modify gene 
activity in plants (Chuang and Meyerowitz (1999) Proc. Natl. Acad. Sci. 97:4985-9490). 

Another method for abolishing the expression of a gene is by insertion mutagenesis 
using the T-DNA of Agrobacterium tumefaciens. After generating the insertion mutants, the 
mutants can be screened to identify those containing the insertion in a TF gene. Mutants 
containing a single mutation event at the desired gene may be crossed to generate homozygous 
plants for the mutation (Koncz et al. (1992) Methods in Arabidopsis Research. World Scientific). 

A plant trait may also be modified by using the cre-lox system (for example, as described 
in US Pat. No. 5,658,772). A plant genome may be modified to include first and second iox sites 
that are then contacted with a Cre recombinase. If the Iox sites are in the same orientation, the 
intervening DNA sequence between the two sites is excised. If the Iox sites are in the opposite 
orientation, the intervening sequence is inverted. 

The polynucleotides and polypeptides of this invention may also be expressed in a plant 
in the absence of an expression cassette by manipulating the activity or expression level of the 
endogenous gene by other means. For example, by ectopically expressing a gene by T-DNA 
activation tagging (Ichikawa et al., (1997) Nature 390 698-701, Kakimoto et aL, (1996) 
Science 274: 982-985). This method entails transforming a plant with a gene tag containing 
multiple transcriptional enhancers and once the tag has inserted into the genome, expression 
of a flanking gene coding sequence becomes deregulated. In another example, the 
transcriptional machinery in a plant may be modified so as to increase transcription levels of a 
polynucleotide of the invention (See PCT Publications WO9606166 and WO 9853057 which 
describe the modification of the DNA binding specificity of zinc finger proteins by changing 
particular amino acids in the DNA binding motif). 

The transgenic plant may also comprise the machinery necessary for expressing or 
altering the activity of a polypeptide encoded by an endogenous gene, for example by altering 
the phosphorylation state of the polypeptide to maintain it in an activated state. 

4. Transgenic Plants with Modified TF Expression 

Once an expression cassette comprising a polynucleotide encoding a TF gene of this 
invention has been constructed, standard techniques may be used to introduce the 
polynucleotide into a plant in order to modify a trait of the plant. The plant may be any higher 
plant, including gymnosperms, monocotyledonous and dicotyledenous plants. Suitable 
protocols are available for Leguminosae (alfalfa, soybean, clover, etc.), Umbeltiferae (carrot, 
celery, parsnip), Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), Curcurbitaceae 
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(melons and cucumber), Gramineae (wheat, corn, rice, barley, millet, etc.), Solanaceae 
(potato, tomato, tobacco, peppers, etc.), and various other crops. See protocols described in 
Ammirato et al. (1984) Handbook of Plant Cell Culture -Crop Species. Macmilian Publ. Co. 
Shimamoto et al. (1989) Nature 338:274-276; Fromm et al. (1990) Bio/Technology 8:833- 
5 839; and Vasil et al. (1990) Bio/Technology 8:429-434. 

Transformation and regeneration of both monocotyledonous and dicotyledonous plant 
cells is now routine, and the selection of the most appropriate transformation technique will be 
determined by the practitioner. The choice of method will vary with the type of plant to be 
transformed; those skilled in the art will recognize the suitability of particular methods for given 

10 plant types. Suitable methods may include, but are not limited to: electroporation of plant 

protoplasts; liposome-mediated transformation; polyethylene glycol (PEG) mediated 
transformation; transformation using viruses; micro-injection of plant cells; micro-projectile 
bombardment of plant cells; vacuum infiltration; and Agrobacterium tumeficiens mediated 
transformation. Transformation means introducing a nucleotide sequence in a plant in a 

15 manner to cause stable or transient expression of the sequence. 

Successful examples of the modification of plant characteristics by transformation with 
cloned sequences which serve to illustrate the current knowledge in this field of technology, 
and which are herein incorporated by reference, include: U.S. Patent Nos. 5,571,706; 
5,677,175; 5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 5,780,708; 

20 5,538,880; 5,773,269; 5,736,369 and 5,610,042. 

Following transformation, plants are preferably selected using a dominant selectable 
marker incorporated into the transformation vector. Typically, such a marker will confer 
antibiotic or herbicide resistance on the transformed plants, and selection of transformants can 
be accomplished by exposing the plants to appropriate concentrations of the antibiotic or 

25 herbicide. 

After transformed plants are selected and grown to maturity, those plants showing a 
modified trait are identified. The modifed trait may be any of those traits described above. 
Additionally, to confirm that the modified trait is due to changes in expression levels or activity 
of the polypeptide or polynucleotide of the invention may be determined by analyzing mRNA 
30 expression using Northern blots, RT-PCR or microarrays, or protein expression using 

immunoblots or Western blots or gel sh\fi assays. 

5. Commercial Applications of the Polynucleotides and Polypeptides 

35 Specific applications for the genes of the present invention relate to their potential 

roles in plant flowering time or the vernalization response. Most modern crop varieties are the 
result of extensive breeding programs and many generations of backcrossing may be required 
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to introduce desired traits. Systems that accelerate flowering could have valuable applications 
in such programs since they allow much faster generation times. Additionally, in some 
instances, a faster generation time might allow additional harvests of a crop to be made within 
a given growing season. With the advent of transformation systems for tree species such as 
oil palm, aspen, pine and eucalyptus, forest biotechnology is a growing area of interest. 

Also, in species such as sugarbeet where the vegetative parts of the plants constitute 
the crop and the reproductive tissues are discarded, it would be advantageous to delay or 
prevent flowering. Extending vegetative development could bring about large increases in 
yields. 

Furthermore, by regulating the expression of flowering-time controlling genes, using 
inducible promoters, flowering could potentially be triggered as desired (for example, by 
application of a chemical inducer). This would allow, for example, flowering to be synchronized 
across a crop and facilitate more efficient harvesting. Such inducible systems could be used to 
tune the flowering of crop varieties to different latitudes. At present, species such as soybean 
and cotton are available as a series of maturity groups that are suitable for different latitudes 
on the basis of their flowering time (which is governed by day-length). A system in which 
flowering could be chemically controlled would allow a single high-yielding northern maturity 
group to be grown at any latitude. In southern regions such plants could be grown for longer, 
thereby increasing yields, before flowering was induced. In more northern areas, the induction 
would be used to ensure that the crop flowers prior to the first winter frosts. Currently, the 
existence of a series of maturity groups for different latitudes represents a major barrier to the 
introduction of new valuable traits. 

For many crop species, high yielding winter-varieties can only be grown in temperate 
regions where the winter season is prolonged and cold enough to elicit a vernalization 
response. Altered expression of the genes of the invention could compensate for a 
vernalization treatment in late-flowering Arabidopsis ecotypes. Similar effects might be 
achieved in crop plants. Winter varieties of wheat, for instance, which over-express G157 (or 
the wheat ortholog) might then be grown in areas like Southern California which would 
otherwise be too warm to allow effective vernalization. A second application for this system is 
in cherry {Pmnus). Locally grown cherries are unavailable in the early Californian spring since 
the winters are too warm for vernalization to occur. 

A further application exists in strawberry (Fragaria). Strawberry has a well-defined 
perennial cycle of flower initiation, dormancy, chilling, crop growth and runner production. In 
temperate European countries, the plants flower in early spring, and fruit is produced in May 
or June. Following fruiting, runners are generated that carry plantlets which take root. The 
plants then remain dormant all through the late summer and autumn. Flowering cannot be 
repeated until the following spring after the plants have received a winter cold treatment. A 
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system, which bypasses this vernalization requirement, might permit a second autumn crop of 
strawberries to be harvested in addition to the spring crop. 

Finally, in addition to the direct applications of the genes themselves, their regulatory 
regions could also be of value. If the promoters of these genes are responsive to low 
temperatures they could be incorporated into expression systems for regulation of genes that 
confer tolerance to freezing. Such genes would then be up regulated specifically at the time 
required, thereby minimizing any toxic effects that result from their constitutive expression. 

6. Other Utility of the Polypeptide and Polynucleotides 

A transcription factor coding provided by the present invention may also be used to 
identify exogenous or endogenous molecules that may affect expression of the transcription 
* factors and may affect flowering time. These molecules may include organic or inorganic 
compounds. 

For example, the method may entail first placing the molecule in contact with a plant 
or plant cell. The molecule may be introduced by topical administration, such as spraying or 
soaking of a plant, and then the molecule's effect on the expression or activity of the TF 
polypeptide or the expression of the polynucleotide monitored. Changes in the expression of 
the TF polypeptide may be monitored by use of polyclonal or monoclonal antibodies, gel 
electrophoresis or the like. Changes in the expression of the corresponding polynucleotide 
sequence may be detected by use of microarrays, Northerns or any other technique for 
monitoring changes in mRNA expression. These techniques are exemplified in Ausubel et al. 
(eds) Current Protocols in Molecular Biology, John Wiley & Sons (1998). Such changes in the 
expression levels may be correlated with modified plant traits and thus identified molecules 
may be useful for soaking or spraying on fruit, vegetable and grain crops to modify traits in 
plants. 

The transcription factors may also be employed to identify promoter sequences with 
which they may interact. After identifying a promoter sequence, interactions between the 
transcription factor and the promoter sequence may be modified by changing specific 
nucleotides in the promoter sequence or specific amino acids in the transcription factor that 
interact with the promoter sequence to alter a plant trait. Typically, transcription factor DNA 
binding sites are identified by gel shift assays. After identifying the promoter regions, the 
promoter region sequences may be employed in double-stranded DNA arrays to identify 
molecules that affect the interactions of the TFs with their promoters (Bulyk et al. (1999) 
Nature Biotechnology 17:573-577). 

The identified transcription factors are also useful to identify proteins that modify the 
activity of the transcription factor. Such modification may occur by covalent modification, such 
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as by phosphorylation, or by protein-protein (homo or-heteropolymer) interactions. Any 
method suitable for detecting protein-protein interactions may be employed. Among the 
methods that may be employed are co-immunoprecipitation, cross-linking and co-purification 
through gradients or chromatographic columns, and the two-hybrid yeast system. 
5 The two-hybrid system detects protein interactions in vivo and is described in Chien, 

et al., (1991), Proc. Natl Acad. Sci. USA, 88, 9578-9582 and is commercially available from 
Clontech (Palo Alto, Calif.). In such a system, plasmids are constructed that encode two 
hybrid proteins: one consists of the DNA-binding domain of a transcription activator protein 
fused to the TF polypeptide and the other consists of the transcription activator protein's 

10 activation domain fused to an unknown protein that is encoded by a cDNA that has been 

recombined into the plasmid as part of a cDNA library. The DNA-binding domain fusion 
plasm id and the cDNA library are transformed into a strain of the yeast Saccharomyces 
cerevisiae that contains a reporter gene (e.g., lacZ) whose regulatory region contains the 
transcription activator's binding site. Either hybrid protein alone cannot activate transcription of 

15 the reporter gene. Interaction of the two hybrid proteins reconstitutes the functional activator 

protein and results in expression of the reporter gene, which is detected by an assay for the 
reporter gene product. Then, the library plasmids responsible for reporter gene expression are 
isolated and sequenced to identify the proteins encoded by the library plasmids. After 
identifying proteins that interact with the transcription factors, assays for compounds that 

20 interfere with the TF protein-protein interactions may be preformed. 

The following examples are intended to illustrate but not limit the present invention. 

EXAMPLES 

Methods 

25 All experiments were performed using Arabidopsis of ecotype Columbia except where 

otherwise indicated. The Stockholm (CS6863) and Pitztal (CS6832) lines were supplied by 
the ABRC at Ohio State University. In all experiments, seeds were sterilized by a 2 minute 
ethanol treatment followed by 30 minutes in 30% bleach / 0.01% Tween and five washes in 
distilled water. Seeds were sown to MS agar in 0.1% agarose and stratified for 3-5 days at 4 

30 °C, before transfer to growth rooms with a temperature of 20-25 °C. MS media was 

supplemented with 50mg/l kanamycin for selection of transformed plants. Plants were 
transplanted to soil after 7 days of growth on plates. For vernalization treatments, seeds were 
sown to MS agar plates, sealed with micropore tape, and placed in a 4°C cold room with low 
light levels for 6-8 weeks. The plates were then transferred to the growth rooms alongside 

35 plates containing freshly sown non-vernalized controls. Whole vegetative seedlings were 

harvested for gene expression analysis at 6 to 9 days after transfer. Rosette leaves were 
counted when a visible inflorescence of approximately 3 cm was apparent. Rosette and total 
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leaf number on the progeny stem are tightly correlated with the timing of flowering (Koornneef 
etal (1991) Mo/. Gen. Genet 229:57-66. 

Example I. Full Length Gene Identification and Cloning 

For the following examples, G157 refers to SEQ ID Nos 1 and 2, G859 refers to SEQ 
ID Nos. 3-8, G1842 refers to SEQ ID Nos. 9-16, G1843 refers to SEQ ID Nos. 17 and 18, 
G1844 refers to SEQ ID Nos. 19-22, G861 refers to SEQ ID Nos. 23-26 and FLC or G1759 
refers to SEQ ID Nos. 27, 28. 

Putative transcription factor sequences (genomic or ESTs) related to known 
transcription factors were identified in the Arabidopsis thaliana GenBank database using the 
tbiastn sequence analysis program using default parameters and a P-value cutoff threshold of 
-4 or -5 or lower, depending on the length of the query sequence. Putative transcription 
factor sequence hits were then screened to identify those containing particular sequence 
strings. If the sequence hits contained such sequence strings, the sequences were confirmed 
as transcription factors. 

For example, we identified a MADS box gene G157 within BAG F22K20 (GenBank 
accession AC002291) from Chromosome 1 that was predicted to encode a protein related to 
FLC. An 872bp cDNA clone for G157 was identified among clones isolated from a library 
derived from leaf mRNA. The encoded protein was 196 amino acids in length, and shared 
62% overall amino acid sequence identity with FLC, and 82% identity within the MADS DNA 
binding domain. 

G157 is also related to G859, G1842, G1843, and G1844 that map together as a 
tightly linked cluster, at the bottom of chromosome V, that occupies approximately 22 kb and 
spans three adjacent clones, MXK3, F1505, and MQN23 (GenBank accession numbers 
AB019236, AB026633, and AB013395, respectively). G859, G1842, G1843, and G1844 are 
all arranged in the same orientation. G859, G1842, G1843, and G1844 were likely created by 
a duplication event; this could have allowed their divergence into different aspects of gene 
regulation. Their physical proximity suggests that they may act as a unit controlled via 
common regulatory elements. 

The pair-wise comparisons of the 57 amino acid MADS domains of FLC, G157, G859, 
G1842, G1843, and G1844 are displayed in Table 1. The table shows percent amino acid 
sequence identity and, in parentheses, the sequence identity percentages when conservative 
amino acid substitutions are considered. The MADS domains of the proteins encoded by 
G859, G1842, G1843, and G1844 are highly conserved with those of FLC and G157: these 
proteins share from 75% to 91% of amino acid sequence identity, depending on the pair-wise 
comparison as shown below. When conservative amino substitutions are made, the MADS 
domains of these proteins are 88%-99% identical to each other (shown in parentheses). 
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Table 1 Percentage of amino acid identity in the MADS domain 



FLC (G1759) 



G157 G859 G1842 G1843 G1844 



FLC (G1759) 



100% 



82%(96%) 84%(94%) 77%(91%) 78%(99%) 75%(92%) 

100% 87%(95%) 89%(94%) 78%(95%) 78%(93%) 

100% 91%(94%) 77%(94%) 78%(92%) 

100% 77%(91% 78%(88%) 

100% 85%(92%) 



G157 
G859 
G1842 
G1843 
G1844 



100% 



Amino acid residue 30 of FLC and by G157, G859, G1842, G1843, and G1844 is an 
acidic residue (E or D) whereas, in all other Arabidopsis MADS domain proteins so far 
identified, that position is occupied by a positively charged lysine residue. The crystal 
structure of the human SRF MADS domain bound to DNA has shown that lysine residue 
(which is also conserved in yeast MCM1 and human MEF2A proteins) to contact the 
phosphate backbone of the DNA target site (Pellegrini et a/., (1995) Nature 376:490-498). 
That amino acid difference could therefore confer DNA binding properties to FLC and by 
G157, G859, G1842, G1843, and G1844 distinct from other Arabidopsis MADS domain - 
proteins. Therefore, MADS domain proteins with an acidic residue at position 30 may be 
particularly useful in modifying plant flowering time and vernalization response. 

The transcripts from these genes were analyzed by 3' RACE (Rapid Amplification 
cDNA Ends) and corresponding cDNAs were isolated by RT-PCR from mixed samples of 
Arabidopsis tissue (Columbia ecotype). During this analysis, it was found that G859, G1842 
and G1844 transcripts exist in multiple alternatively spliced forms. 

Example II. Flowering Time Associated Genes 

Reverse transcriptase PCR was done using gene specific primers within the coding 
region for each sequence identified. Where possible, the primers were designed near the 3' 
region of each coding sequence initially identified. 

Total RNA was isolated from plant tissue tissue and extracted using CTAB. Once 
extracted total RNA was normalized in concentration across all the tissue types to ensure that 
the PCR reaction for each tissue received the same amount of cDNA template using the 28S 
band as reference. Poly A+ was purified using a modified protocol from the Qiagen Oligotex 
kit batch protocol. cDNA was synthesized using standard protocols. After the first strand 
cDNA synthesis, primers for Actin 2 were used to normalize the concentration of cDNA across 
the tissue types. Actin 2 is found to be constitutively expressed in fairly equal levels across 
the Arabidopsis tissue types. 

For RT PCR, cDNA template was mixed with corresponding primers and Taq 
polymerase. Each reaction consisted of 0.2 ul cDNA template, 2ul 10X Tricine buffer, and 
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16.8 ul water, 5pmol Primer 1, 5pmol Primer 2, 0.3 ul Taq polymerase, 200uM dNTPs and 8.6 
ul water. 

The 96 well plate was covered with microfilm and set in the Thermocycler to start the 
following reaction cycle. Stepl 93° C for 3 mins, Step 2 93° C for 30 sec, Step 3 60-65° C 
5 for 1 min, Step 4 72° C for 2 mins,. Steps 2, 3 and 4 were repeated for 20-35 cycles, Step 5 

72° C for 5 mins and Step 6 4° C. The PCR plate was sometimes placed back in the 
thermocycler to amplify more products for 5-15 more cycles to identify genes that have very 
low expression. The reaction cycle was as follows: Step 2 93° C for 30 sec, Step 3 65° C for 1 
min, and Step 4 72° C for 2 ins, repeated for 8 cycles, and Step 4 4° C. 

10 Eight microliters of PCR product and 1 .5 ul of loading dye were loaded on a 1 .2% 

agarose gel for analysis between 21 and 36 cycles. Expression levels of specific transcripts 
were considered low if they were only detectable after 35 cycles of PCR. Expression levels 
were considered medium or high depending on the levels of transcript compared with 
observed transcript levels for actin2. 

15 As an example, to assess G157 mRNA levels in G157 plants, PCR was carried out 

over 25 cycles using primers 5'-GGCATAACCCTTATCGGAGATTTGAAGC-3' (SEQ ID No. 
57) and S'-ACACAAACTCTGATCTrGTCTCCGAAGG-S* (SEQ ID No. 58). To assess mRNA 
levels in different tissues extracted from wild type plants, 25 or 30 cycles of PCR were 
performed using primers S'-GCATAACCCTTATCGGAGATTTGAAGCCAT-S' (SEQ ID No. 59) 

20 and 5VVACATTCCTCTCTCATCATCTGTTGCCAGC-3' (SEQ ID No. 60). PCR for FLC was 

performed either with primers 5-AACGCTTAGTATCTCCGGCGACTTGAAC-3' (SEQ ID No. 
51) and 5'-CTCACACGAATAAGGTACAAAGTTCATC-3' (SEQ ID No. 62) over 35 cycles, or 
S'-TTAGTATCTCCGGCGACTTGAACCCAAACC-S 1 (SEQ ID No. 63) and 5- 
AGATTCTCAACAAGCTTCAACATGAGTTCG-3' (SEQ ID No. 64) over 30 cycles. Primer 

25 specificity was verified by sequencing RT-PCR products. Samples were standardized via 20- 

25 cycles of PCR with actin primers. 

Example III. Construction of Expression Vectors 

30 The sequence was amplified from a genomic or cDNA library using primers specific to 

sequences upstream and downstream of the coding region. The expression vector was 
pMEN20 or pMEN65, which are both derived from pMON316 (Sanders et al, (1987) Nucleic 
Acids Research 15:1543-58) and contain the CaMV 35S promoter to express transgenes. To 
clone the sequence into the vector, both pMEN20 and the amplified DNA fragment were 

35 digested separately with Sail and Notl restriction enzymes at 37° C for 2 hours. The digestion 

products were subject to electrophoresis in a 0.8% agarose gel and visualized by ethidium 
bromide staining. The DNA fragments containing the sequence and the linearized plasmid 
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were excised and purified by using a Qiaquick gel extraction kit (Qiagen, CA). The fragments 
of interest were ligated at a ratio of 3:1 (vector to insert). Ligation reactions using T4 DNA 
ligase (New England Biolabs, MA) were carried out at 16° C for 16 hours. The ligated DNAs 
were transformed into competent cells of the E. coli strain DHSalpha by using the heat shock 
method. The transformations were plated on LB plates containing 50 mg/l spectinomycin 
(Sigma). 

Individual colonies were grown overnight in five milliliters of LB broth containing 50 
mg/l spectinomycin at 37° C. Plasmid DNA was purified by using Qiaquick Mini Prep kits 
(Qiagen, CA). 

Example IV. Transformation of Agrobacterium with the Expression Vector 

After the plasmid vector containing the gene was constructed, the vector was used to 
transform Agrobacterium tumefaciens cells expressing the gene products. The stock of 
Agrobacterium tumefaciens cells for transformation were made as described by Nagel et al. 
FEMS Microbiol Letts 67: 325-328 (1990). Agrobacterium strain GV3101 was grown in 250 ml 
LB medium (Sigma) overnight at 28°C with shaking until an absorbance (Aeoo) of 0.5 - 1.0 was 
reached. Cells were harvested by centrifugation at 4,000 x g for 15 min at 4°C. Cells were 
then resuspended in 250 pi chilled buffer (1 mM HEPES, pH adjusted to 7.0 with KOH). Cells 
were centnfuged again as described above and resuspended in 125 pi chilled buffer. Cells 
were then centrifuged and resuspended two more times in the same HEPES buffer as 
described above at a volume of 100 pi and 750 pi, respectively. Resuspended cells were then 
distributed into 40 pi aliquots, quickly frozen in liquid nitrogen, and stored at -80° C. 

Agrobacterium cells were transformed with plasmids prepared as described above 
following the protocol described by Nagel et al. FEMS Microbiol Letts 67: 325-328 (1990). For 
each DNA construct to be transformed, 50-100 ng DNA (generally resuspended in 10 mM 
Tris-HCI, 1 mM EDTA, pH 8.0) was mixed with 40 pi of Agrobacterium cells. The DNA/cell 
mixture was then transferred to a chilled cuvette with a 2mm electrode gap and subject to a 
2.5 kV charge dissipated at 25 pF and 200 pF using a Gene Pulser II apparatus (Bio-Rad). 
After electroporation, cells were immediately resuspended in 1.0 ml LB and allowed to recover 
without antibiotic selection for 2 - 4 hours at 28° C in a shaking incubator. After recovery, cells 
were plated onto selective medium of LB broth containing 100 pg/ml spectinomycin (Sigma) 
and incubated for 24-48 hours at 28° C. Single colonies were then picked and inoculated in 
fresh medium. The integrity of the plasmid construct was verified by PCR amplification and 
sequence analysis. 
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Example V. Transformation of Arabidopsis Plants with Agrobacterium tumefaciens 
with Expression Vector 

After transformation of Agrobacterium tumefaciens with plasmid vectors containing the 
gene, single Agrobacterium colonies were identified, propagated, and used to transform 
Arabidopsis plants. Briefly, 500 ml cultures of LB medium containing 50 mg/l spectinomycin 
were inoculated with the colonies and grown at 28° C with shaking for 2 days until an 
absorbance (A^o) of > 2.0 is reached. Cells were then harvested by centrifugation at 4,000 x 
g for 10 min, and resuspended in infiltration medium (1/2 X Murashige and Skoog salts 
(Sigma), 1 X Gamborg's B-5 vitamins (Sigma), 5.0% (w/v) sucrose (Sigma), 0.044 
benzylamino purine (Sigma), 200 pl/L Silwet L-77 (Lehle Seeds) until an absorbance (A^q) of 
0.8 was reached. 

Prior to transformation, Arabidopsis thaiiana seeds (ecotype Columbia) were sown at 
a density of -10 plants per 4" pot onto Pro-Mix BX potting medium (Hummert International) 
covered with fiberglass mesh (18 mm X 16 mm). Plants were grown under continuous 
illumination (50-75 pE/m 2 /sec) at 22-23° C with 65-70% relative humidity. After about 4 
weeks, primary inflorescence stems (bolts) are cut off to encourage growth of multiple 
secondary bolts. After flowering of the mature secondary bolts, plants were prepared for 
transformation by removal of all siliques and opened flowers. 

The pots were then immersed upside down in the mixture of Agrobacterium infiltration 
medium as described above for 30 sec, and placed on their sides to allow draining into a 1* x 
2' flat surface covered with plastic wrap. After 24 h, the plastic wrap was removed and pots 
are turned upright. The immersion procedure was repeated one week later, for a total of two 
immersions per pot. Seeds were then collected from each transformation pot and analyzed 
following the protocol described below. 

Example VI. Identification of Arabidopsis Primary Transformants 

Seeds collected from the transformation pots were sterilized essentially as follows. 
Seeds were dispersed into in a solution containing 0.1% (v/v) Triton X-100 (Sigma) and sterile 
H 2 0 and washed by shaking the suspension for 20 min. The wash solution was then drained 
and replaced with fresh wash solution to wash the seeds for 20 min with shaking. After 
removal of the second wash solution, a solution containing 0.1% (v/v) Triton X-100 and 70% 
ethanol (Equistar) was added to the seeds and the suspension was shaken for 5 min. After 
removal of the ethanol/detergent solution, a solution containing 0.1% (v/v) Triton X-100 and 
30% (v/v) bleach (Clorox) was added to the seeds, and the suspension was shaken for 10 
min. After removal of the bleach/detergent solution, seeds were then washed five times in 
sterile distilled H 2 0. The seeds were stored in the last wash water at 4° C for 2 days in the 
dark before being plated onto antibiotic selection medium (1 X Murashige and Skoog salts (pH 
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adjusted to 5.7 with 1M KOH), 1 X Gamborg's B-5 vitamins, 0.9% phytagar (Life 
Technologies), and 50 mg/l kanamycin). Seeds were germinated under continuous 
illumination (50-75 pE/m 2 /sec) at 22-23° C. After 7-10 days of growth under these conditions, 
kanamycin resistant primary transformants (T, generation) were visible and obtained. These 
seedlings were transferred first to fresh selection plates where the seedlings continued to 
grow for 3-5 more days, and then to soil (Pro-Mix BX potting medium). 
Primary transformants are self-crossed and progeny seeds (T2) collected. T2 progeny seeds 
were germinated on kanamycin as described above and kanamycin resistant seedlings were 
selected, transferred to soil and analyzed. 

Example VII. Analysis of transgenic Arabidopsis plants 

In a first experiment, G157 plants (ie plants expressing the G157 transgene) were 
grown in 12 hours light. 31 of 40 lines flowered earlier than control plants transformed with a 
control vector. Mean rosette leaf number of early T1 lines was 12.4+/-0.8 whereas control 
lines had 27+/-1 .2 rosette leaves. 2 of 40 T1 plants flowered at the same time as controls and 
7 of 40 lines were late flowering and produced visible inflorescences 2 to 3 weeks after wild 
type. 

In further experiments, plants were grown under conditions of 24 hours light at 20-25 
°C. Under these conditions, the non-transformed control plants produced a mean total of 
14.3+/-0.7 leaves on the primary shoots prior to flower bud initiation. Flower buds were first 
visible on these plants at a mean of 21.1+/-0.5 days after sowing (error values represent 
standard error of the mean to which 95% confidence limits have been attached). For G859, 
14/19 T1 plants were early flowering (mean leaf total of 6.4+/-0.7, flower buds visible at 
12.9+/-0.7days after sowing), 3/19 were wild type, and 2/19 were slightly late flowering 
compared to wild type (mean total of 19 leaves, flower buds visible at 27 days). RT 
expression studies revealed that the late flowering individuals possessed the highest levels of 
transgene expression. These results strongly parallel those obtained for G1 57. For G1842, 
7/10 T1 flowered early (mean total of 7.9+/-0.6 leaves, flower buds visible at 13.9+/-1.0 days), 
and 3/10 plants were wild type. Overexpression studies were also performed with cDNAs 
encoding shortened splice variants of G1842. For G1842.2 (encodes a 185 amino acid splice 
variant), 15/18 T1 plants flowered early (mean total of 6.9+/-0.9 leaves, flower buds visible at 
14.5+/-0.6 days) and 3/18 were wild type. For G1842.6 (encodes a 77 amino acid splice 
variant), 8/10 T1 plants flowered early mean total of 6.8+/-1.6 leaves, flower buds visible at 
13.9+/-0.9 days) and 2/10 were wild type. For G1842.7 (encodes a 118 amino acid splice 
variant) 8/10 T1 plants flowered early (accurate leaf counts not made)"and 2/10 were wild 
type. Thus, the G1842 splice variants produced comparable effects to the full-length cDNA 
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clone when over-expressed. For G1843, 7/1 1 flowered early (mean total of 6.4+/-0.5 leaves, 
flower buds visible at 16.0+/-1 .6 days) and 2/1 1 had a wild type flowering time. The G1843 T1 
plants, however, were dwarfed and showed retarded development of some organs. This 
suggests that G1843 has unpredicted toxic effects when over-expressed. For G1844, 6/10 T1 
5 plants flowered early (mean total of 6.8+/- 1 .7 leaves, flower buds visible at 14.7+/-1 .3 days) 

and 4/10 plants were wild type. Overexpression studies were also performed with a cDNA 
encoding a shortened splice variant of G1844. For overexpression of G1844.2 (encodes a 
184 amino acid splice variant), 6/19 T1 plants flowered early (mean total of 7.8+/-1.7 leaves, 
flower buds visible at 15.7+/-1.3 days) and 13/19 were wild type). The over-expression data 

10 for G859, G1842, G1843, and G1844 support the hypothesis that they have a role in the 

control of flowering time. 

RT-PCR was performed on materials from G157 plants using G157 specific primers at 
approximately 25 cycles. The highest levels of G157 expression were detected in late 
flowering individual plants or in samples from pooled seedlings that contained late flowering 

15 individuals. Plants that showed only moderate or low levels of overexpression compared to 

wild type were slightly early flowering or normal. 

To test whether an increase in G157 could affect flowering time in late flowering 
ecotypes of Arabidopsis, we overexpressed G157 in the late flowering ecotypes Stockholm 
and Piztal. In this experiment, 32 primary transformants from each ecotype were grown 

20 interspersed with controls under continuous light conditions, in both ecotypes, around 50% of 

the transformants flowered earlier than controls, and in some transformants the time to 
flowering was halved. As was observed with Columbia G157 plants, a minority of Pitztal and 
Stockholm transformants were clearly later flowering compared to controls. 

A correlation between G1 57 transgene expression and flowering time was also 

25 observed in G157 Stockholm and Pitztal T1 plants. RT-PCR was performed with two early 

and two late flowering lines in each background. Again, the late flowering lines contained the 
higher levels of G157 expression. Thus, the factor appears to affect flowering time in a 
quantitative manner; a modestjeve! of overexpression triggers early flowering, whereas a 
larger increase delays flowering. 

30 In conclusion, over-expression of G157 or any of the related genes modifies flowering 

time in plants: a modest level of over-expression triggers early flowering, whereas a larger 
increase delays flowering. 

Using similar or identical methodologies described in the examples above, further 
Arabidopsis genes were identified whose altered expression was correlated with delayed or 

35 accelerated flowering. These genes are tabulated in Table 2 with their Sequence Listing Nos., 

and their effects on flowering time. 
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Table 2. Further Arabidopsis genes for manipulating flowering time 



OtU IU (NOS. 


«ene 


observations 


23, 24 


G861 


early or late flowering 


25, 26 


G861.1 


early or late flowering 


29, 30 


G192 


late flowering 


31,32 


G234 


late flowering 


33, 34 


G361 


late flowering 


35, 36 


G486 


late flowering 


37, 38 


G748 


late flowering 


39, 40 


G994 


late flowering 


41,42 


G1335 


late flowering 


43, 44 


G562 


late flowering 


45, 46 


G736 


late flowering 


47,48 


G1073 


late flowering 


49, 50 


G1435 


late flowering 


51,52 


G180 


early flowering 


53, 54 


G592 


early flowering 


55, 56 


G208 


early flowering- 



The vernalization response was also investigated. Late flowering vernalization 
responsive ecotypes and mutants have high steady state levels of FLC transcript, which 
decrease during the promotion of flowering by vernalization (Michaels and Amasino, (1999) 
Plant Cell 11:949-956; Sheldon et al., (1999) Plant Cell 11:445-458; Sheldon et al., (2000) 
Proc. Natl. Acad. Sc/. 97: 3735-3758). In contrast to FLC, G157 transcript levels show no 
consistent correlation with the vernalization response in the late flowering Stockholm and 
Pitztal ecotypes. Additionally we found that over-expression of G 157 did not influence FLC 
levels. The effects of vernalization on expression of G861, G859, G1842, G1843, and G1844 
were also examined. Germinating seeds of Columbia, Pitztal, Stockholm, constans-1, and 
fca-9 were vernalized on MS agar plates in a 4°C cold room for 8 weeks, and then transferred 
to a continuous light growth room. Total tissues from the vernalized seedlings, and freshly 
sown non-vernalized controls were harvested at 9 days after the transfer. RT-PCR was 
performed for FLC, G157, G859, G1842, G1843, G1844, and G861, and actin. Compared to 
FLC and G157, none of the genes showed a clear consistent decline upon vernalization in the 
five different sample sets. However, G1844 displayed a converse pattern of expression to 
FLC: G1844 levels consistently increased on vernalization. This is particularly significant as it 
directly implicates G1844 in control of the vernalization response. Thus G1844 likely activates 
flowering and has an opposing role to FLC. 



28 



WO 01/26459 



PCT/US00/28141 



To explore whether overexpression of G157 produces comparable effects on 
vernalization, batches of wild type Pitztal and Stockholm seedlings were cold treated for 6 
weeks at 4°C, then grown amongst a second selection of G157 T1 Pitztal, G157 T1 Stockholm 
and non-vernalized wild type plants. As expected, vernalization markedly and uniformly 
reduced flowering time in both Pitztal and Stockholm wild type plants. Amongst the G157 
Stockholm lines, the earliest flowering T1 group (8/23 lines) was indistinguishable from 
vernalized plants. For Pitztal, however, the early flowering T1 plants were on average 
marginally later than the vernalized plants. Therefore, overexpression of G157 can 
substantially reduce the requirement for vernalization in late flowering ecotypes. 

Additionally, we observed that the late flowering of G157 lines is independent of FLC 
expression and does not respond to vernalization. However, the late flowering G157 plants 
are responsive to photoperiod. In an experiment conducted under short day conditions of 8 
hours of light, we obtained a number of G157 Columbia T1 plants that flowered up to a month 
later than wild type controls (data not shown). To confirm that the late flowering effects caused 
by G157 overexpression were independent of FLC transcription, we tested whether late 
flowering G157 Columbia plants were responsive to vernalization. No significant change in 
flowering time was noted: in continuous light conditions, vernalized T2 plants of line 4 had a 
total of 31.3 +/- 1.8 leaves compared to 30.1 +/- 1.3 when non-vernalized. Control fca plants 
verified that the treatment was effective: vernalized plants flowered after only 10.3 +/- 0.9 
leaves compared to more than 40 leaves for the non-vernalized controls. Thus, the late 
flowering phenotype caused by G157 could not be overcome by vernalization, as would be 
expected if the delay occurred independently of changes in FLC expression 

Example IX. Identification of Homologous Sequences 

Homologs from plant species other than Arabidopsis were identified using database 
sequence search tools, such as the Basic Local Alignment Search Tool (BLAST) (Altschul et 
al. (1990) J. Mol. Biol. 215:403-410; and Altschul et al. (1997) Nucl. Acid Res. 25: 3389-3402). 
The tblastx sequence analysis programs were employed using the BLOSUM-62 scoring matrix 
(Henikoff, S. and Henikoff, J. G. (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919). 

The entire NCBI Genbank database was filtered for sequences from all plants except 
Arabidopsis thatiana by selecting all entries in the NCBI Genbank database associated with 
NCBI taxonomic ID 33090 (Viridiplantae; all plants) and excluding entries associated with 
taxonomic ID 3701 (Arabidopsis thaliana). These sequences were compared to sequences 
representing genes of SEQ IDs 1-56 on 9/26/2000 using the Washington University TBLASTX 
algorithm (version 2.0a19MP). For each gene of SEQ IDs 1-56, individual comparisons were 
ordered by probability score (P-value), where the score reflects the probability that a particular 
alignment occurred by chance. For example, a score of 3.6e-40 is 3.6 x 10" 40 . For up to ten 
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species, the gene with the lowest P-value (and therefore the most likely homolog) is listed in 
Figure 2. 

In addition to P-values, comparisons were also scored by percentage identity. 
Percentage identity reflects the degree to which two segments of DNA or protein are identical 
over a particular length. The ranges of percent identity between the non-Arabidopsis genes 
shown in Figure 2 and the Arabidopsis genes in the sequence listing are: SEQ ID No. 1: 54%- 
67%; SEQ ID Nos. 3,5,7: 37%-47%; SEQ ID Nos. 9,11,13.15: 54%-62%; SEQ ID No. 17: 
62%-71%; SEQ ID Nos. 19, 21: 50%-67%; SEQ ID Nos. 23.25: 75%-91%; SEQ ID No. 27: 
46%-69%; SEQ ID No. 29: 44%-90%; SEQ ID No. 31: 57-89%; SEQ ID No. 33: 37%-79%; 
SEQ ID No. 35: 50%-71%; SEQ ID No. 37: 39%-63%; SEQ ID No. 39: 58%-70%; SEQ ID No. 
41: 45%-73%; SEQ ID No. 43: 42%-84%; SEQ ID No. 45: 47%-81%; SEQ ID No. 47: 31%- 
71%; SEQ ID No. 49: 40%-67%; SEQ ID No. 51: 69%-51%; SEQ ID No. 53: 43%-86%; and 
SEQ ID No. 55: 79%-89%. 

Arabidopsis homologs of genes in Table 2 were also identifed using BLAST. These 
genes are found in the following Arabidopsis BAC sequences, identified by their Genbank 
sequence NID numbers: 2827698 (G234 homolog), 3241917 (G748 homolog), 2618604 
(G994 homolog), 6598548 (G1335 homolog), 7340331 (G736 homolog), 6523051 (G1435 
homolog), 6598491 (G208 homolog) and 3172156 (G208 homolog). 

All references (publications and patents) are incorporated herein by reference in their 
entirety for all purposes. 

Although the invention has been described with reference to the embodiments and 
examples above, it should be understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, the invention is limited only by the 
following claims. 
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cDNA 
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protein 
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cDNA 
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protein 
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cDNA 
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G859.1 


protein 


2-57 
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cDNA 
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protein 


2-57 
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G1842 


cDNA 




10 


G1842 


protein 


2-57 


11 


G 1842.2 


cDNA 




12 


G 1842.2 


protein 


2-57 


13 


G1 842.6 


cDNA 




14 


G 1842.6 


protein 


2-57 


15 


G 1842.7 


cDNA 




16 


G 1842.7 


protein 


2-57 


17 


G1843 


cDNA 




18 


G1843 


protein 


2-57 


19 


G1844 


cDNA 




20 


G1844 


protein 


2-57 • 


21 


G 1844.2 


cDNA 




22 


G 1844.2 


protein 


2-57 


23 


G861 


cDNA 




24 


G861 


protein 


2-57 


25 


G861.1 


cDNA 




26 


G861.1 


protein 


2-57 


27 


G1759 


cDNA 




28 


G1759 


protein 


2-57 


29 


G192 


cDNA 




30 


G192 


protein 


128-185 


31 


G234 


cDNA 




32 


G234 


protein 


14-115 


33 


G361 


cDNA 




34 


G361 


protein 
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G486 


cDNA 
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G486 


protein 
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cDNA 
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protein 
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cDNA 
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protein 
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cDNA 
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protein 
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cDNA 
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protein 
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protein 
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cDNA 
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protein 
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protein 
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53 
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cDNA 




54 
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protein • 
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55 


G208 


cDNA 




56 
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protein 
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Figure 2A 
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Genbank NID P-value 
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6.00E-13 


IVlCVJlVsay 1/ lIUIIvvdLUIa 





f^1 ^7 


8707754 


1.40E-12 


nurucufii vuiyarti 





r^i ^7 


2293891 


1.40E-12 


Poti tnia v hx/KriHa 
rciuiMd A iiyuiiua 







19870 


1.40E-12 


iNiVrfUiidf la lauavUiii 


1 


1 ^7 
0 10/ 


7628118 


3.70E-12 


r^r»c c\/ni i im arhorenm 
vjuooyfJiui 1 1 diuvjicuiii 





H1^7 


5050220 


3.80E-12 


vjuooyfjiui i ' iiiiouiuiii 





O (Of 


9414215 


4.50E-12 




^67 

0,0, f 




6530836 


1.40E-34 


i_y uvjfjci oiVrfVji i cav/uici uui 1 1 


^ *5 7 
O,o, f 


VJOO«7 


5777903 


4.70E-30 


Mali !<; HnmpQtir 1 ?! 


*5 7 




9367312 


7.10E-30 


Hnrrlpi im x/nlnsrp 
riuivjcuiii vuiycaic 


^ ^ 7 
O.O, / 


oooy 


6467973 


3.60E-29 


RnnH ro hi i i im nroY Marlamo Thnnn.lM 

L/trr lUf uujuih yit?A iviouainc iuu(iy-iiN 


1^7 
0,0, f 


oooy 


4204233 


1.20E-28 


1 nliitm lomi ilonti im 


0,0, / 




939784 


2.50E-28 


ccd rnayb 


7 ^ 7 


oooy 


6651032 


3.10E-28 


v^apoicum dnnuum 


^ ^ 7 
0,0,/ 


oooy 


1483227 


4.60E-28 


D^tiilo nan/Hi 1 1 o 
OclUla pcMUUIa 


^ ^ 7 
0,0,/ 


oooy 


5295983 


8.70E-28 


vjryza oouVd 


0,0, / 


oooy 


5070137 


1.10E-27 


iNicoiiana syivesins 


y,i i t i o, io 


c-\ QAO 
ol OHZ 


6530836 


5.90E-19 


Lycoperstcon escuienium 


y, i i , i o, i o 


fii fl/i9 

VJ 1 OH^ 


5606765 


8.00E-15 


vjiycine max 


Q 1 1 1 1 ^ 

y, i i , i o, i o 


o i o*+z 


6826955 


1.20E-12 


£-Qa mays 


y,T i ,i o, i o 


u 1(54^ 


4979250 


1.50E-11 


VjiyZa Saliva 


y, i i , i o, i o 


O lOH^ 


6536942 


1.50E-11 


IvloUfUayU uUnualUla 


y, i i , i o, i u 


f^1 R/19 
vj 1 OH^ 


7501504 


4.00E-1 1 


vjussypiurii aruuicuin 


Q 1 1 1 1 ^ 

y , i 1,10,10 


O I OH^ 


9444818 


4.70E-11 


i nitcum dcoiivuiii 


Q 11 1 O 1 C 

y, i i , i o, i o 


o I OH^ 


5859176 


5.40E-1 1 


"iriUb IdcUd t 


q 11 1 *a 1 e; 

9, 1 1 , IO, l<J 


(X\RA0 


5777905 


6.80E-11 


IVIalUo UUIIICoLILra 


q 11 1 ^ 1 s 

U, I 1 r I O, I vJ 




6647105 


6.80E-11 


Macomhrv/anfhomi im rrv/ctallini im 
ivicoc?! 1 iui yai in ici I iui 1 1 v^i ydiaiiii tui 1 1 


17 
1 1 




8707754 


6.60E-15 


WnrHpiim v/iilnarp ' 

1 l\J\ UCUI 1 1 VUIUQI C 


17 




5606765 


1.10E-14 


f^lv//*inp may 
vjiywii ic uiaA 


17 


fi1 A4^ 


4387730 


1.50E-14 




17 


f^1 

VJ I 040 


3646325 


1.60E-14 


ividiuo uvji i icoiioa 


17 


VJ I OHO 


7625048 


1.60E-14 


r^nccv/niiim arhnrpitm 
vjuooypiui j i aiuuicuiii 


17 


r;i O/io 
vj 1 OHO 


5050220 


1.80E-14 


vjosoypiuin iifibuiuin 


1 7 


f51 AA^ 
vj 1 Oho 


9429009 


3.00E-14 


1 riUCUiTl aCollVUlTI 


1 7 


f21 QA"3 
O 1 Oho 


7145381 


6.30E-14 


/LGa mays 


1 7 

1 / 


rsi q^o 
vj l oho 


3824730 


8.20E-14 


vjiyia Saliva 


17 


vj I OHO 


4528048 


2.30E-13 


V^lUUb UllbillU 


1 Q *91 

i y, z i 


ri 1 OA A 

VJ I OHH 


5606765 


1.70E-14 


VJiyClMC 1 1 laA 


1 Q 91 

i y, c. i 


("51 AAA 
vj I OHH 


8707754 


3.30E-13 


nuraeurn vuiyaf c 


1Q 91 

i y , z i 


rsi QAA 
vj I OHH 


9429009 


4.40E-13 


1 MllLiLJMI aCbllVUIII 


10 91 
I y, ^ 1 


f5l AAA 
vj I OHH 


4979250 


1.00E-12 
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G1844 
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GossvDium arboreum 
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G1844 


5050220 


1.20E-12 


Gossypium hirsutum 


19,21 


G1844 


6530836 


1.40E-12 


Lycopersicon esculentum 


19,21 


G1844 


6918768 


1.70E-12 


Zea mays 


19, 21 


G1844 


6536942 


3.50E-12 


Medicago truncatula 


19,21 


G1844 


2252481 


3.70E-12 


Ceratopteris richardii 


23,25 


G861 


5601313 


8.20E-49 


Lycopersicon esculentum 


23,25 


G861 


2735763 


1.50E-37 


Solarium tuberosum 


23,25 


G861 


6652755 


5.40E-37 


Paulownia kawakamii 
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Figure 2B 



SEQ IDs 


Gene Ids 


Genbank NID 


P-value 


Species 


23,25 


G861 


9367233 


3.30E-33 


Hordeum vulgare 


23,25 


G861 


7672990 


8.90E-29 


Canavaiia lineata 


23,25 


G861 


3986688 


5.20E-26 


Cichorium intybus 


23,25 


G861 


7552197 


2.40E-25 


Sorghum bicolor 


23,25 


G861 


5295977 


4.90E-24 


Oryza sativa 


23,25 


G861 


9194959 


3.60E-19 


Medicago truncatula 


23,25 


G861 


3855425 


4.40E-19 


Populus tremula x Populus tremuloides 


27 


G1759 


5606765 


4.60E-16 


Glycine max 


27 


G1759 


7647685 


4.10E-15 


Lycopersicon escuientum 


27 


G1759 


4979250 


2.70E-14 


Oryza sativa 


27 


G1759 


8707754 


6.30E-14 


Hordeum vulgare 


27 


G1759 


5777905 


6.80E-14 


Malus domestica 


27 


G1759 


7626240 


1.10E-13 


Gossypium arboreum 


27 


G1759 


5047371 


1.10E-13 


Gossypium hirsutum 


27 


G1759 


6918768 


1.20E-13 


Zea mays 


27 


G1759 


8574456 


1.30E-13 


Capsicum annuum 


27 


G1759 


8216956 


1.30E-13 


Cucumis sativus 


29 


G192 


7284340 


3.60E-40 


Glycine max 


29 


G192 


7779802 


1.10E-39 


Lotus japonicus 


29 


G192 


9361307 


9.40E-28 


Triticum aestivum 


29 


G192 


7340336 


8.10E-24 


Oryza sativa 


29 


G192 


6529152 


4.70E-23 


Lycopersicon escuientum 


29 


G192 


7206269 


2.90E-22 


Medicago truncatula 


29 


G192 


4886128 


4.50E-15 


Zea mays 


29 


G192 


8706346 


4.70E-13 


Hordeum vulgare 


29 


G192 


9302479 


8.80E-13 


Sorghum bicolor 


29 


G192 


3326241 


2.40E-12 


Gossypium hirsutum 


31 


G234 


9193243 


7.50E-60 


Medicago truncatula 


31 


G234 


926451 1 


3.30E-57 


Glycine max 


31 


G234 


7412424 


3.60E-49 


Lycopersicon escuientum 


31 


G234 


8335078 


2.60E-48 


Oryza sativa 


31 


G234 


7218651 


1.00E-42 


Sorghum bicolor 


31 


G234 


9364630 


9.90E-40 


Triticum aestivum 


31 


G234 


6079814 


5.10E-36 


Gossypium arboreum 


31 


G234 


9252441 


5.40E-35 


Solanum tuberosum 


31 


G234 


5860031 


1.00E-33 


Pinus taeda 


31 


G234 


5050757 


2.60E-33 


Gossypium hirsutum 


33 


G361 


7561045 


2.30E-21 


Medicago truncatula 


33 


G361 


9307604 


1.20E-17 


Sorghum bicolor 


33 


G361 


4119050 


1.70E-13 


Oryza sativa 


33 


G361 


8175037 


7.30E-13 


Hordeum vulgare 


33 


G361 


8329902 


5.30E-09 


Mesembryanthemum crystallinum 


33 


G361 


6534259 


1.20E-08 


Lycopersicon escuientum 


33 


G361 


7283798 


1.30E-08 


Glycine max 


33 


G361 


3854369 


5.50E-08 


Populus tremula x Populus tremuloides 


33 


G361 


9365078 


1 .70E-07 


Triticum aestivum 


33 


G361 


5268965 


0.00023 


Zea mays 


35 


G486 


6845875 


3.10E-36 


Glycine max 


35 


G486 


8172030 


4.20E-29 


Medicago truncatula 


35 


G486 


9416562 


6.40E-29 


Triticum aestivum 


35 


G486 


5050127 


4.90E-28 


Gossypium hirsutum 


35 


G486 


7628400 


6.10E-28 


Gossypium arboreum 


35 


G486 


7781090 


2.10E-27 


Lotus japonicus 
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Figure 2C 









P-value Species 


35 


G486 


223/y 


3.40E-27 


Zea mays 


35 


G486 


O /I A -A 17C 


4.80E-27 


Chlamydomonas reinhardtii 


35 


G486 


/4uyoio 


1.30E-26 


Lycopersicon esculentum 


o c 

35 


G486 


Ofl7 4 ceo 

80/ 1558 


1.40E-26 


Solanum tuberosum 


37 


G748 


853689 


5.60E-87 


Cucurbita maxima 


Of 


G748 


f ii4ZOy f 


3.10E-59 


Oryza sativa 


17 

or 


G748 


coooccn 
OOOODOU 


9.70E-46 


Lycopersicon esculentum 


or 


G748 


CI A 4 ccc 
D041 OOO 


4.50E-38 


Glycine max 


37 


G748 


r\ a r\r\A Af\ 

yiyui40 


2.90E-35 


Medicago truncatula 


37 


G748 


7C1C77C 

7535776 


4.00E-33 


Sorghum bicolor 


37 


G748 


C\ A A O Af\ A 

9419494 


1.70E-31 


Hordeum vulgare 


37 


G748 


r\ a a r\ a c ~7 

9410157 


8.20E-29 


Triticum aestivum 


37 


G748 


3929324 


3.50E-25 


Dendrobium grex Madame Thong-IN 


37 


G748 


6020953 


7.30E-21 


Zea mays 


39 


G994 


6651291 


1.50E-55 


Pimpineila brachycarpa 


39 


G994 


7561750 


5.60E-51 


Medicago truncatula 


39 


G994 


5268844 


2.10E-50 


Zea mays 


39 


G994 


1430845 


3.10E-50 


Lycopersicon esculentum 


39 


G994 


1 945282 


5.40E-49 


Oryza sativa 


39 


G994 


22637 


1.40E-46 


Physcomitrella patens 


39 


G994 


7626566 


4.40E-44 


Gossypium arboreum 


o r\ 

39 


G994 


2921339 


4.50E-44 


Gossypium hirsutum 


39 


G994 


7590249 


3.60E-43 


Glycine max 


39 


G994 


20562 


6.30E-43 


Petunia x hybrida 


A A 

41 


G1335 


19742 


8.40E-63 


Nicotiana sylvestris 


41 


G1335 


5398738 


1.20E-59 


Zea mays 


41 


/■> j nor 

G1335 


9361467 


1.40E-50 


Triticum aestivum 


>• A 

41 


G1335 


8330366 


1.60E-48 


Mesembryanthemum crystallinum 


41 


G1335 


81 74823 


7.50E-43 


Hordeum vulgare 


A 4 
41 


G1335 


ccnccno 

boyoo^o 


8.00E-42 


Pinus taeda 


41 


G1335 


7721 1UU 


1.20E-39 


Lotus japonicus 


A A 
41 


G1335 


75021 ro 


2.60E-37 


Gossypium arboreum 


j4 A 

41 


G1335 


A QA~TA 70 

1817176 


5.60E-36 


Pinus radiata 


41 


G1335 


7CCAft70 

7550978 


3.30E-35 


Sorghum bicolor 


A O 

43 


G562 


1399004 


6.60E-142 


Brassica napus 


43 


G562 


COO H O A n 

5381310 


6.80E-53 


Catharanthus roseus 


A O 

43 


^ r ^o 

G562 


169958 


3.80E-45 


Glycine max 


43 


G562 


2879779 


3.60E-43 


Spinacia oleracea 


43 


G562 


7565950 


2.10E-41 


Medicago truncatula 


43 


G562 


728627 


4.50E-41 


Nicotiana tabacum 


43 


G562 


4 c cn c o 

1 155053 


2.30E-40 


Phaseolus vulgaris 


43 


G562 


1498300 


5.70E-40 


Petroselinum crispum 


43 


G562 


5046889 


6.70E-34 


Gossypium hirsutum 


43 


G562 


8328888 


2.60E-25 


Mesembryanthemum crystallinum 


40 


Of OO 


/ 4uyo2 f 


1.40E-37 


Lycopersicon esculentum 


45 


G736 


9197391 


5.60E-32 


Medicago truncatula 


45 


G736 


9419494 


4.70E-27 


Hordeum vulgare 


45 


G736 


7328718 


1.30E-25 


Oryza sativa 


45 


G736 


9410157 


1 .80E-25 


Triticum aestivum 


45 


G736 


853689 


5.20E-25 


Cucurbita maxima 


45 


G736 


7535776 


6.60E-25 


Sorghum bicolor 


45 


G736 


3929324 


4.70E-21 


Dendrobium grex Madame Thong-IN 


45 


G736 


2393774 


9.60E-20 


Zea mays 
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Figure 2D 



SEQ IDs 


Gene Ids 


Genbank NID 


P-value 


Species 


45 


G736 


7624398 


1.10E-19 


Gossypiurn arboreum 


47 


G1073 


7718401 


2.20E-55 


Medicago truncatuta 


47 


G1073 


6846994 


2.50E-44 


Glycine max 


47 


G1073 


7615218 


1 .60E-42 


Lotus japonicus 


47 


G1073 


7333102 


2.70E-34 


Lycopersicon esculentum 


47 


G1073 


9445090 


3.40E-25 


Triticum aestivum 


47 


G1073 


9252370 


2.20E-24 


Solanum tuberosum 


47 


G1073 


5042437 


4.60E-21 


Oryza sativa 


47 


G1073 


7536402 


5.30E-20 


Sorghum bicolor 


47 


G1073 


2213535 


7.30E-19 


Pisum sativum 


47 


G1073 


7624850 


2.10E-18 


Gossypiurn arboreum 


49 


G1435 


9203811 


3.70E-37 


Glycine max 


49 


G1435 


9430136 


4.10E-35 


Lycopersicon esculentum 


49 


G1435 


8904354 


4.30E-32 


Hordeum vulgare 


49 


G1435 


5050706 


3.30E-26 


Gossypiurn hirsutum 


49 . 


G1435 


7614196 


6.40E-19 


Lotus japonicus 


49 


G1435 


7551484 


1.00E-18 


Sorghum bicolor 


49 


G1435 


6916552 


7.20E-12 


Lycopersicon pennellii 


49 


G1435 


2443007 


5.50E-11 . 


Oryza sativa 


49 


G1435 


9255229 


1.30E-10 


Zea mays 


49 


G1435 


7766737 


2.80E-10 


Medicago truncatula 


51 


G180 


8468047 


1.90E-35 


Oryza sativa 


51 


G180 


7559831 


1.20E-24 


Medicago truncatula 


51 


G180 


5272716 


9.90E-24 


Lycopersicon esculentum 


51 


G180 


9187621 


3.30E-23 


Solanum tuberosum 


51 


G180 


6566312 


1.30E-22 


Glycine max 


51 


G180 


9304207 


1.30E-21 


Sorghum bicolor 


51 


G180 


7721184 


1.30E-20 


Lotus japonicus 


51 


G180 


9444636 


3.10E-19 


Triticum aestivum 


51 


G180 


3220212 


5.20E-19 


Gossypiurn hirsutum 


51 


G180 


1159876 


8.00E-19 


Avena fatua 


53 


G592 


7924069 


7.10E-27 


Glycine max 


53 


G592 


5896650 


1.10E-22 


Lycopersicon esculentum 


53 


G592 


6279773 


1.10E-17 


Lycopersicon pennellii 


53 


G592 


9364330 


1.20E-14 


Triticum aestivum 


53 


G592 


6166282 


5.40E-14 


Pinus taeda 


53 


G592 


8367093 


1.60E-12 


Zea mays 


53 


G592 


9301543 


6.60E-11 


Sorghum bicolor 


53 


G592 


7562632 


2.80E-10 


Medicago truncatula 


53 


G592 


702652 


5.80E-05 


Oryza sativa 


53 


G592 


7322923 


0.094 


Lycopersicon hirsutum 


55 


G208 


437326 


2.80E-65 


Gossypiurn hirsutum 


55 


G208 


7765706 


4.40E-64 


Medicago truncatula 


55 


G208 


5269878 


5.80E-64 


Lycopersicon esculentum 


55 


G208 


19054 


6.90E-63 


Hordeum vulgare 


55 


G208 


2605616 


1.00E-62 


Oryza sativa 


55 


G208 


7626566 


3.50E-62 


Gossypiurn arboreum 


55 


G208 


6667606 


4.10E-62 


Glycine max 


55 


G208 


517492 


1.80E-60 


Zea mays 


55 


G208 


9302672 


2.40E-57 


Sorghum bicolor 


55 


G208 


5860031 


1 .30E-54 


Pinus taeda 
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We Claim: 

1. A transgenic plant comprising a recombinant polynucleotide comprising a nucleotide 
sequence encoding a polypeptide comprising at least 6 consecutive amino acids of a 
sequence selected from the group consisting of SEQ ID Nos. 2N, where N=1-28 but excluding 
SEQ ID No. 28, wherein said transgenic plant has (i) a modified flowering time compared with 
another plant lacking the recombinant polynucleotide or (ii) a modified vernalization 
requirement compared with another plant lacking the recombinant polynucleotide. 

2. The transgenic plant of claim 1, wherein the nucleotide sequence encodes a polypeptide 
comprising a conserved domain selected from the group consisting of conserved domains of 
SEQ ID Nos. 2N, where N=1-28. 

3. The transgenic plant of claim 1, wherein the recombinant polynucleotide further comprises a 
promoter operably linked to said nucleotide sequence. 

4. The transgenic plant of claim 3, wherein said promoter is constitutive or inducible or tissue- 
active. 

5. The transgenic plant of claim 1, wherein said recombinant polynucleotide encodes a 
polypeptide comprising a conserved domain having greater than an 84% sequence identity to 
a sequence selected from the group consisting of SEQ ID Nos. 2N, where N=1-28. 

6. A method for altering the flowering time or vernalization requirement of a plant, said 
method comprising (a) transforming a plant with a recombinant polynucleotide comprising a 
nucleotide sequence encoding a polypeptide comprising at least 6 consecutive amino acids of 
a sequence selected from the group consisting of SEQ ID Nos. 2N, where N=1-28 but 
excluding SEQ ID No. 28, (b) selecting said transformed plants; and (c) identifying a 
transformed plant having an altered flowering time. 

7. The method of claim 6, wherein the nucleotide sequence encodes a polypeptide 
comprising a conserved domain selected from the group consisting of conserved domains of 
SEQ ID Nos. 2N t where N=1-28. 

8. The method of claim 6, wherein the recombinant polynucleotide further comprises a 
promoter operabiy linked to said nucleotide sequence. 
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9. The method of claim 8, wherein said promoter is constitutive or inducible or tissue-active. 

10. The method of claim 1, wherein said recombinant polynucleotide encodes a polypeptide 
5 comprising a conserved domain having greater than an 84% sequence identity to a sequence 

selected from the group consisting of SEQ ID Nos. 2N, where ISM -28. 

11. A method for altering the flowering time or vernalization requirement of a plant, said 
method comprising (a) transforming the plant with a recombinant polynucleotide comprising a 

10 nucleotide sequence-comprising at least 18 consecutive nucleotides of a sequence selected 

from the group consisting of SEQ ID Nos. 2N-1 , where N= 1-28, but excluding SEQ ID No. 27; 
and (b) selecting said transformed plant. 

12. The method of claim 11, wherein said recombinant polynucleotide encodes a polypeptide 
15 comprising a conserved domain having greater than an 84% sequence identity to a sequence 

selected from the group consisting of SEQ .ID Nos. 2N, where N=1-28. 

13. A method for altering a plant's flowering time or vernalization requirement, said method 
comprising (a) providing a database sequence; (b) comparing said database sequence with a 

20 polypeptide selected from SEQ ID Nos. 2N, where N= 1-28; (c) selecting a database 

sequence that meets selected sequence criteria; and (d) transforming said selected database 
sequence in the plant. 

14. The method of claim 13, wherein said recombinant polynucleotide encodes a polypeptide 
25 comprising a conserved domain having greater than an 84% sequence identity to a sequence 

selected from the group consisting of SEQ ID Nos. 2N, where N=1-28. 

1 5. A method for altering a plant's flowering time or vernalization requirement, said method 
comprising (a) providing a database sequence; (b) comparing said database sequence with a 

30 polynucleotide selected from SEQ ID Nos. 2N-1 , where N= 1-28; (c) selecting a database 

sequence that meets selected sequence criteria; and (d) transforming said selected database 
sequence in the plant. 

16. The method of claim 15, wherein said recombinant polynucleotide encodes a polypeptide 
35 comprising a conserved domain having greater than an 84% sequence identity to a sequence 

selected from the group consisting of SEQ ID Nos. 2N, where N=1~28. 
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MBI-0021.txt 
SEQUENCE LISTING 

<110> Ratcliffe, Oliver 
Heard, Jacqueline 
Samaha , Raymond 
Creelman, Robert 
Keddie , James 
Jiang, Cai-zhong 
Reuber , Lynne 
Riechmann, Jose Luis 

<12 0> Flowering Time Modification 

<130> MBI-0021 

<150> US 60/15.9 ,.464 
<151> 1999-10-12- 

<150> US 60/166,228 
<151> 1999-11-17 

<150> US 60/197, 899 

<151> 2000-04-17 

<150> Plant Trait Modification III 

<151> 2000-08-22 

<150> US 60/164,132 
<151> 1999-11-08 

<160> 64 

<170> Patentln version 3.0 

<210> 1 

<211> 883 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (31) . . (621) 

<223> G157 



<400> 1 

gggcataacc cttatcggag atttgaagcc atg gga aga aga aaa ate gag ate 54 

Met Gly Arg Arg Lys lie Glu lie 

1 5 

aag cga ate gag aac aaa age agt cga caa gtc act ttc tec aaa cga 102 

Lys Arg lie Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Ser Lys Arg 

10 15 20 

cgc aat ggt etc ate gac aaa get cga caa ctt teg att etc tgt gaa 150 

Arg Asn Gly Leu lie Asp Lys Ala Arg Gin Leu Ser lie Leu Cys Glu 

25 30 35 40 

tec tec gtc get gtt gtc gtc gta tct gee tec gga aaa etc tat gac 198 

Ser Ser Val Ala Val Val Val Val Ser Ala Ser Gly Lys Leu Tyr Asp 

45 50 55 



tct tec tec ggt gac gac att tec aag ate att gat cgt tat gaa ata 246 
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Ser Ser Ser Gly Asp Asp lie Ser Lys He He Asp Arg Tyr Glu He 
60 65 70 

caa cat get gat gaa ctt aga gec tta gat ctt gaa gaa aaa att cag 294 
Gin His Ala Asp Glu Leu Arg Ala Leu Asp Leu Glu Glu Lys He Gin 
75 80 85 

aat tat ctt cca cac aag gag tta eta gaa aca gtc caa age aag ctt 342 
Asn Tyr Leu Pro His Lys Glu Leu Leu Glu Thr Val Gin Ser Lys Leu 
90 95 100 

gaa gaa cca aat gtc gat aat gta agt gta gat tct eta att tct ctg 390 
Glu Glu Pro Asn Val Asp Asn Val Ser Val Asp Ser Leu He Ser Leu 
105 110 115 120 

gag gaa caa ctt gag act. get ctg tec gta agt aga get agg aag gca 43 8 

Glu Glu Gin- Leu Glu Thr Ala Leu Ser Val Ser Arg Ala Arg Lys Ala 

125 130 135 

gaa ctg atg, atg gag tat ate gag tec ctt aaa gaa aag gag aaa ttg 486 
Glu Leu Met Met Glu Tyr He Glu Ser Leu Lys Glu Lys Glu Lys Leu 
140 145 150 

ctg aga gaa gag aac cag gtt ctg get age cag atg gga aag aat acg 534 
Leu Arg Glu Glu Asn Gin Val Leu Ala Ser Gin Met Gly Lys Asn Thr 
155 160 165 

ttg ctg gca aca gat gat gag aga gga atg ttt ccg gga agt age tec 582 
Leu Leu Ala Thr Asp Asp Glu Arg Gly Met Phe Pro Gly Ser Ser Ser 
170 175 180 

ggc aac aaa ata ccg gag act etc ccg ctg etc aat tag ccaccatcat 631 
Gly Asn Lys He Pro Glu Thr Leu Pro Leu Leu Asn 
185 190 195 

caaeggctga gttttcacct taaactcaaa gectgattea taattaagag aataaatttg 691 

tatattataa aaagctgtgt aatctcaaac cttttatctt cctctagtgt ggaatttaag 751 

gtcaaaaaga aaacgagaaa gtatggatca gtgttgtacc tectteggag acaagatcag 811 

agtttgtgtg tttgtgtctg aatgtacgga ttggattttt aaagttgtgc tttctttctt 871 

caaaaaaaaa aa 883 

<210> 2 
<211> 196 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Gly Arg Arg Lys He Glu He Lys Arg He Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu He Asp Lys Ala 
20 25 30 



Arg Gin Leu 
35 



Ser He Leu Cys Glu Ser 

40 



Ser Val Ala Val Val Val Val 
45 
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Ser Ala Ser Gly Lys Leu Tyr Asp Ser Ser Ser Gly Asp Asp lie Ser 
50 55 60 



Lys lie lie Asp Arg Tyr Glu lie Gin His Ala Asp Glu Leu Arg Ala 
65 70 75 80 



Leu Asp Leu Glu Glu Lys lie Gin Asn Tyr Leu Pro His Lys Glu Leu 

85 90 95 



Leu Glu Thr Val Gin Ser Lys Leu Glu Glu Pro Asn Val Asp Asn Val 
100 " - 105 110 



Ser Val Asp Ser Leu lie Ser Leu Glu Glu Gin Leu Glu Thr Ala Leu 
115. 120 125 



Ser Val Ser Arg Ala Arg Lys Ala Glu Leu Met Met Glu Tyr lie Glu 
130 135 140 



Ser Leu Lys Glu Lys Glu Lys Leu Leu Arg Glu Glu Asn Gin Val Leu 
145 150 155 160 



Ala Ser Gin Met Gly Lys Asn Thr Leu Leu Ala Thr Asp Asp Glu Arg 

165 170 175 



Gly Met Phe Pro Gly Ser Ser Ser Gly Asn Lys lie Pro Glu Thr Leu 
180 185 190 



Pro Leu Leu Asn 





195 


<210> 


3 


<211> 


1196 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(132) . . (569) 


<223> 


G859 


<400> 


3 



aaaaaggaga gagagagaga gagagagaga gagagagaga gaaacgaaga aaaaaaaaga 60 

agcaaaaaac attgtgggtc tccggtgatt aggatcaaat tagggcacca gccttatcgg 120 

aggaagaagc c atg ggt aga aaa aaa gtc gag ate aag cga ate gag aac 17 0 
Met Gly Arg Lys Lys Val Glu lie Lys Arg lie Glu Asn 
15 10 

aaa agt agt cga caa gtc act ttc tec aaa cga cgc aat ggt etc ate 218 
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Lys Ser Ser Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu He 
15 20 25 

gag aaa get cga caa ctt tea att etc tgt gaa tct tec ate get gtt 266 
Glu Lys Ala Arg Gin Leu Ser He Leu Cys Glu Ser Ser He Ala Val 
30 35 40 45 

etc gtc gtc tec ggc tec gga aaa etc tac aag tct gee tec ggt gac 314 
Leu Val Val Ser Gly Ser Gly Lys Leu Tyr Lys Ser Ala Ser Gly Asp 

50 55 60 

aac atg tea aag ate att gat cgt tac gaa ata cat cat get gat gaa 362 
Asn Met Ser Lys He He Asp Arg Tyr Glu He His His Ala Asp Glu 
65 70 75 

ctt gaa gee tta gat ctt gca gaa aaa act egg aat tat ctg cca etc 410 
Leu Glu Ala Leu Asp Leu Ala Glu Lys Thr Arg Asn Tyr Leu Pro Leu 
80 85 90 

aaa gag tta eta gaa ata gtc caa agg tta gca caa aga cac ttt tat 458 
Lys Glu Leu Leu Glu He Val Gin Arg Leu Ala Gin Arg His Phe Tyr 
95 100 105 

etc cct ctt ctt ctg atg aaa aat act ttt ttt ttt ctt ttc ttt tgg 506 
Leu Pro Leu Leu Leu Met Lys Asn Thr Phe Phe Phe Leu Phe Phe Trp 
110 115 120 125 

cga att atg aat aca gca age ttg aag aat caa atg teg ata atg caa 554 
Arg He Met Asn Thr Ala Ser Leu Lys Asn Gin Met Ser lie Met Gin 

130 135 140 

gtg tgg ata ctt taa tttctctgga ggaacagctc gagactgetc tgtcegtaac 60 9 

Val Trp He Leu 
145 



tagagctagg 


aagacagaac 


taatgatggg 


ggaagtgaag 


tcccttcaaa 


aaacgcatgt 


669 


caaagatcat 


tgatcgttat 


gaaatacatc 


atgctgatga 


acttaaagee 


ttagatcttg 


729 


cagaaaaaat 


teggaattat 


cttccacaca 


aggagttact 


agaaatagtc 


caaagattct 


789 


ctaatatcta 


tggaggaaca 


gctcgagact 


gctctgtcag 


taattagagc 


taagaagaca 


849 


gaactaatga 


tggaggatat 


gaagtcactt 


caagaaaggg 


agaagttget 


gatagaagag 


909 


aaccagattc 


tggctageca 


ggtggggaag 


aagacgtttc 


tggttataga 


aggtgacaga 


969 


ggaatgtcat 


gggaaaatgg 


ctccggcaac 


aaagtacggg 


agactcttcc 


getgetcaag 


1029 


taatcaccat 


catcaacggc 


tgagctttca 


ccttaaactt 


acagectgat 


tcagaagttt 


1089 


ttacaaattt 


gtaaattata 


aaaagcttca 


taataatctc 


aaccttttta 


tcttcctcgc 


1149 


gccaatgtgg 


aaattaaggt 


aaaccaaaaa 


aaaaaaaaaa 


aaaaaaa 




1196 



<210> 4 

<211> 145 

<212> PRT 

<213> Arabidopsis thaliana 

<4O0> 4 
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Met Gly Arg Lys Lys Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu lie Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser lie Leu Cys Glu Ser Ser lie Ala Val Leu Val Val 
35 40 45 

Ser Gly Ser Gly Lys Leu Tyr Lys Ser Ala Ser Gly Asp Asn Met Ser 
50 55 60 

Lys lie lie* Asp Arg Tyr Glu lie His His Ala Asp Glu Leu Glu Ala 
65 70 75 80 

Leu Asp Leu Ala Glu Lys Thr Arg Asn Tyr Leu Pro Leu Lys Glu Leu 

85 90 95 

Leu Glu lie Val Gin Arg Leu Ala Gin Arg His Phe Tyr Leu Pro Leu 
100 105 110 

Leu Leu Met Lys Asn Thr Phe Phe Phe Leu Phe Phe Trp Arg He Met 
115 120 125 

Asn Thr Ala Ser Leu Lys Asn Gin Met Ser He Met Gin Val Trp He 
130 135 140 

Leu 
145 

<210> 5 
<211> 972 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (132) . . (569) 

<223> G859.1 

<400> 5 

aaaaaggaga gagagagaga gagagagaga gagagagaga gaaacgaaga aaaaaaaaga 60 

agcaaaaaac attgtgggtc tccggtgatt aggatcaaat tagggcacca gccttatcgg 120 

aggaagaagc c atg ggt aga aaa aaa gtc gag ate aag cga ate gag aac 170 
Met Gly Arg Lys Lys Val Glu He Lys Arg He Glu Asn 
15 10 

aaa agt agt cga caa gtc act ttc tec aaa cga cgc aat ggt etc ate 218 
Lys Ser Ser Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu He 
15 20 25 
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gag aaa get cga caa ctt tea att etc tgt gaa tct tec ate get gtt 266 

Glu Lys Ala Arg Gin Leu Ser lie Leu Cys Glu Ser Ser lie Ala Val 
30 35 40 45 

etc gtc gtc tec ggc tec gga aaa etc tac aag tct gee tec ggt' gac 314 

Leu Val Val Ser Gly Ser Gly Lys Leu Tyr Lys Ser Ala Ser Gly Asp 

50 55 60 

aac atg tea aag ate att gat cgt tac gaa ata cat cat get gat gaa 362 

Asn Met Ser Lys lie lie Asp Arg Tyr Glu He His His Ala Asp Glu 

65 70 75 

ctt gaa gee tta gat ctt gca gaa aaa act egg aat tat ctg cca etc 410 

Leu Glu Ala Leu Asp Leu Ala Glu Lys Thr Arg Asn Tyr Leu Pro Leu 
80 85 90 

aaa gag tta eta gaa ata gtc caa agg tta gca caa aga cac ttt tat 458 

Lys Glu Leu Leu Glu He Val Gin Arg Leu Ala Gin Arg His Phe Tyr 
95 100 105 

etc cct ctt ctt ctg atg aaa aat act ttt ttt ttt ctt ttc ttt tgg 506 

Leu Pro Leu Leu Leu Met Lys Asn Thr Phe Phe Phe Leu Phe Phe Trp 
110 115 120 125 

cga att atg aat aca gca age ttg aag aat caa atg teg ata atg caa 554 

Arg He Met Asn Thr Ala Ser Leu Lys Asn Gin Met Ser He Met Gin 

130 135 140 

gtg tgg ata ctt taa tttctctgga ggaacagctc gagactgetc tgtcegtaac 609 
Val Trp He Leu 
145 



tagagctagg 


aagacagaac 


taatgatggg 


ggaagtgaag 


tcccttcaaa 


aaacggagaa 


669 


ettgetgaga 


gaagagaacc 


agactttggc 


tagecaggtg 


gggaagaaga 


cgtttctggt 


729 


tatagaaggt 


gacagaggaa 


tgtcatggga 


aaatggctcc 


ggcaacaaag 


taegggagae 


789 


tcttccgctg 


ctcaagtaat 


caccatcatc 


aacggctgag 


ctttcacctt 


aaacttacag 


849 


cctgactcag 


aagtttttac 


aaatttgtaa 


attataaaaa 


gcttcataat 


aatctcaacc 


909 


tttttatctt 


cctcgcgcca 


atgtggaaat 


taaggtaaac 


caaaaaaaaa 


aaaaaaaaaa 


969 


aaa 












972 



<210> 6 
<211> 145 
<212> PRT . 

<213> Arabidopsis thaliana 
<400> 6 

Met Gly Arg Lys Lys Val Glu He Lys Arg He Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu He Glu Lys Ala 
20 25 30 
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Arg Gin Leu Ser He Leu Cys Glu Ser Ser He Ala Val Leu Val Val 
35 40 45 

Ser Gly Ser Gly Lys Leu Tyr Lys Ser Ala Ser Gly Asp Asn Met Ser 
50 55 60 

Lys He He Asp Arg Tyr Glu He His His Ala Asp Glu Leu Glu Ala 
65 70 75 80 

Leu Asp Leu Ala Glu Lys Thr Arg Asn Tyr Leu Pro Leu Lys Glu Leu 

85 90 95 

Leu Glu lie Val Gin Arg Leu Ala Gin Arg His Phe Tyr Leu Pro Leu 
100 105 110 

Leu Leu Met Lys Asn Thr Phe Phe Phe Leu Phe Phe Trp Arg He Met 
115 120 125 

Asn Thr Ala Ser Leu Lys Asn Gin Met Ser He Met Gin Val Trp He 
130 135 140 

Leu 
145 

<210> 7 
<211> 1036 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (162).. (752) 

<223> G859.2 

<400> 7 

gatttgtcat tttttgtcta gccaaaaaaa aaaaaaaaaa aggagagaga gagagagaga 60 

gagagagaga gaaacgaaga aaaaaaaaga agcaaaaaac attgtgggtc tccggtgatt 12 0 

aggatcaaat tagggcacca gccttatcgg aggaagaagc c atg ggt aga aaa aaa 17 6 

Met Gly Arg Lys Lys 
1 5 

gtc gag ate aag cga ate gag aac aaa agt agt cga caa gtc act ttc 224 
Val Glu He Lys Arg He Glu Asn Lys Ser Ser Arg Gin Val Thr Phe 

10 15 20 

tec aaa cga cgc aat ggt etc ate gag aaa get cga caa ctt tea att 272 
Ser Lys Arg Arg Asn Gly Leu lie Glu Lys Ala Arg Gin Leu Ser He 
25 30 35 

etc tgt gaa tct tec ate get gtt etc gtc gtc tec ggc tec gga aaa 320 

Leu Cys Glu Ser Ser He Ala Val Leu Val Val Ser Gly Ser Gly Lys 
40 45 50 

Page 7 



■I 



tcaccatcat 


caacggctga 


gctttcacct 


taaacttaca 


gcctgattca 


gaagttttta 


812 


caaatttgta 


aattataaaa 


agcttcataa 


taatctcaac 


ctttttatct 


tcctcgcgcc 


872 


aatgtggaaa 


ttaaggttaa 


aaataaaata 


aaacagaagc 


tcatgcgaaa 


gaattgtaaa 


932 


actaagataa 


agctatagta 


gatctttatt 


gtaccttcgt 


agacgatata 


agatttattc 


992 


gtgtgtttgt 


cttcccctcn 


aaaaaaaaaa 


aaaaaaaaaa 


aaaa 




1036 



<210> 8 
<211> 196 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Gly Arg Lys Lys Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 



Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu lie Glu Lys Ala 
20 25 30 
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etc tac aag tct gec tec ggt gac aac atg tea aag ate att gat cgt 368 j 

Leu Tyr Lys Ser Ala Ser Gly Asp Asn Met Ser Lys He He Asp Arg i 
55 60 65 | 

tac gaa ata cat cat get gat gaa ctt gaa gec tta gat ctt gca gaa 416 

Tyr Glu He His His Ala Asp Glu Leu Glu Ala Leu Asp Leu Ala Glu 
70 75 80 85 

aaa act egg aat tat ctg cca etc aaa gag tta eta gaa ata gtc caa 464 

Lys Thr Arg Asn Tyr Leu Pro Leu Lys Glu Leu Leu Glu He Val Gin 

90 95 100 

age aag ctt gaa gaa tea aat gtc gat aat gca agt gtg gat act tta 512 

Ser Lys Leu Glu Glu Ser Asn Val Asp Asn Ala Ser Val Asp Thr Leu 
105 110 115 

att tct ctg gag gaa cag etc gag act get ctg tec gta act aga get 560 

He Ser Leu Glu Glu Gin Leu Glu Thr Ala Leu Ser Val Thr Arg Ala 
120 125 130 

agg aag aca gaa eta atg atg ggg gaa gtg aag tec ctt caa aaa acg 608 

Arg Lys Thr Glu Leu Met Met Gly Glu Val Lys Ser Leu Gin Lys Thr 
135 140 145 

gag aac ttg ctg aga gaa gag aac cag act ttg get age cag gtg ggg 656 

Glu Asn Leu Leu Arg Glu Glu Asn Gin Thr Leu Ala Ser Gin Val Gly 
150 155 160 165 

aag aag acg ttt ctg gtt ata gaa ggt gac aga gga atg tea tgg gaa 704 

Lys Lys Thr Phe Leu Val He Glu Gly Asp Arg Gly Met Ser Trp Glu 

170 175 180 

aat ggc tec ggc aac aaa gta egg gag act ctt ccg ctg etc aag taa 752 

Asn Gly Ser Gly Asn Lys Val Arg Glu Thr Leu Pro Leu Leu Lys 
185 190 195 
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Arg Gin Leu Ser He Leu Cys Glu Ser Ser He Ala Val Leu Val Val 
35 40 45 



Ser Gly Ser Gly Lys Leu Tyr Lys Ser Ala Ser Gly Asp Asn Met.. Ser 
50 55 60 



Lys He He Asp Arg Tyr Glu He His His Ala Asp Glu Leu Glu Ala 
65 . 70 75 80 



Leu Asp Leu Ala Glu Lys Thr Arg Asn Tyr Leu Pro Leu Lys Glu Leu 

85 90 95 



Leu Glu lie Val Gin Ser Lys Leu Glu Glu Ser Asn Val Asp Asn Ala 
100 105 110 

Ser Val Asp Thr Leu He Ser Leu Glu Glu Gin Leu Glu Thr Ala Leu 
115 120 125 



Ser Val Thr Arg Ala Arg Lys Thr Glu Leu Met Met Gly Glu Val Lys 
130 135 140 

Ser Leu Gin Lys Thr Glu Asn Leu Leu Arg Glu Glu Asn Gin Thr Leu 
145 150 155 160 

Ala Ser Gin Val Gly Lys Lys Thr Phe Leu Val He Glu Gly Asp Arg 

165 170 175 

Gly Met Ser Trp Glu Asn Gly Ser Gly Asn Lys Val Arg Glu Thr Leu 
180 185 190 



Pro Leu Leu Lys 





195 


<210> 


9 


<211> 


1059 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(219) . . (809) 


<223> 


G1842 


<400> 


9 



actattacat gcctcttcct cgcttcaaaa cggcaccgtt tccacttgtt attatttttc 60 

tctctatcgt ctaacaaaaa aaaaaactga cttgggattt tttttcattt gtctagccca 120 

aaagaagaag atagaaacga agaaaaaaag caaacacatt ttgggtcccc ggtggttagg 180 

atcaaattag ggcacaaacc ttatcggaga aagaagcc atg gga aga aga aaa gtc 236 
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Met Gly Arg Arg Lys Val 
1 5 

gag ate aag cga ate gag aac aaa age agt cga caa gtc act ttc tec 284 
Glu lie Lys Arg He Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Ser 
10 15 20 

aaa cga cgc aaa ggt etc ate gaa aaa get cga caa ctt tea att etc 332 
Lys Arg Arg Lys Gly Leu He Glu Lys Ala Arg Gin Leu Ser He Leu 
25 30 35 

tgt gaa tct tec ate get gtt gtc gee gtc tec ggt tec gga aaa etc 380 
Cys Glu Ser Ser He Ala Val Val Ala Val Ser Gly Ser Gly Lys Leu 
40 45 50 

tac gac tct gec tec ggt gac aac atg tea aag ate att gat cgt tat 428 
Tyr Asp" Ser Ala Ser Gly Asp Asn Met Ser Lys He He Asp Arg Tyr 
55 60 65 70 

gaa ata cat cat get gat gaa ctt aaa gee tta gat ctt gca gaa aaa 476 
Glu He His His Ala Asp Glu Leu Lys Ala Leu Asp Leu Ala Glu Lys 

75 80 85 

att egg aat tat ctt cca cac aag gag tta eta gaa ata gtc caa age 524 
He Arg .Asn Tyr Leu Pro His Lys Glu Leu Leu Glu He Val Gin Ser 
90 95 100 

aag ctt gaa gaa tea aat gtc gat aat gta agt gta gat tct eta ata 572 
Lys Leu Glu Glu Ser Asn Val Asp Asn Val Ser Val Asp Ser Leu He 
105 110 115 

tct atg gag gaa cag etc gag act get ctg tea gta att aga get aag 620 
Ser Met Glu Glu Gin Leu Glu Thr Ala Leu Ser Val He Arg Ala Lys 
120 125 130 

aag aca gaa eta atg atg gag gat atg aag tea ctt caa gaa agg gag 668 
Lys Thr Glu Leu Met Met Glu Asp Met Lys Ser Leu Gin Glu Arg Glu 
135 140 145 150 

aag ttg ctg ata gaa gag aac cag att ctg get age cag gtg ggg aag 716 
Lys Leu Leu He Glu Glu Asn Gin He Leu Ala Ser Gin Val Gly Lys 

155 160 165 

aag acg ttt ctg gtt ata gaa ggt gac aga gga atg tea egg gaa aat 764 
Lys Thr Phe Leu Val He Glu Gly Asp Arg Gly Met Ser Arg Glu Asn 
170 175 .180 

ggc tec ggc aac aaa gta ccg gag act ctt teg ctg etc aag taa 809 
Gly Ser Gly Asn Lys Val Pro Glu Thr Leu Ser Leu Leu Lys 
185 190 195 



tcaccatcat 


caaeggctga 


gctttcacca 


taaacttact 


cacagcctga 


ttcagaagct 


869 


tttacaaaat 


tgtaaattat 


aaaaagctgc 


ataataatct 


caaccttttt 


atcttcctcg 


929 


cgccaatgtg 


gaaataaagg 


taaaacaaaa 


cgaagctctt 


ttcttttatg 


cgaaagaatt 


989 


gtaaaactaa 


gataaagcta 


ccgatctttg 


ttgtacctta 


gtagacaaat 


atcagagttc 


1049 


ttgtgcttgt 












1059 



<210> 10 



Page 10 



WO 01/26459 



PCT/US00/28141 



MBI-0021. txt 

<211> 196 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 10 

Met Gly Arg Arg Lys Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
1 5 10 15 $ 

Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly Leu lie Glu Lys Ala 
20 25 30 



Arg Gin Leu Ser lie Leu Cys Glu Ser Ser lie Ala Val Val Ala Val 

35 - ■ - ■ 40 45 



Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser Gly Asp Asn Met Ser 
50 55 60 



Lys lie lie Asp Arg Tyr Glu lie His His Ala Asp Glu Leu Lys Ala 
65 70 75 80 



Leu Asp Leu Ala Glu Lys lie Arg Asn Tyr Leu Pro His Lys Glu Leu 

85 90 95 



Leu Glu He Val Gin Ser Lys Leu Glu Glu Ser Asn Val Asp Asn Val 
100 105 110 



Ser Val Asp Ser Leu He Ser Met Glu Glu Gin Leu Glu Thr Ala Leu 
115 120 125 



Ser Val He Arg Ala Lys Lys Thr Glu Leu Met Met Glu Asp Met Lys 
130 135 140 



Ser Leu Gin Glu Arg Glu Lys Leu Leu He Glu Glu Asn Gin He Leu 
145 150 155 160 



Ala Ser Gin Val Gly Lys Lys Thr Phe Leu Val He Glu Gly Asp Arg 

165 170 175 



Gly Met Ser Arg Glu Asn Gly Ser Gly Asn Lys Val Pro Glu Thr Leu 
180 185 190 



Ser Leu Leu Lys 
195 



<210> 11 

<211> 880 

<212> DNA 

<213> Arabidopsis thaliana 
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<220> 

<221> CDS 

<222> (79) . . (636) 

<223> G1842.2 

<400> 11 

agaaaaaaag caaacacatt ttgggtcccc ggtggttagg atcaaattag ggcacaaacc 60 

ttatcggaga aagaagcc atg gga aga aga aaa gtc gag ate aag cga ate 111 

Met Gly Arg Arg Lys Val Glu lie Lys Arg lie 
15 10 

gag aac aaa age agt cga caa gtc act ttc tec aaa cga cgc aaa ggt 159 
Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly 
15 20 25 

etc ate gaa aaa get cga caa ctt tea att etc tgt gaa tct tec ate 207 
Leu lie Glu Lys Ala Arg Gin Leu Ser lie Leu Cys Glu Ser Ser He 
30 ' 35 40 

get gtt gtc gec gtc tec ggt tec gga aaa etc tac gac tct gec tec 255 
Ala Val Val Ala Val Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser 
45 50 55 

ggt gac aac atg tea aag ate att gat cgt tat gaa ata cat cat get 303 
Gly Asp Asn Met Ser Lys He He Asp Arg Tyr Glu He His His Ala 
60 65 70 75 

gat gaa ctt aaa gee tta gat ctt gca gaa aaa att egg aat tat ctt 351 
Asp Glu Leu Lys Ala Leu Asp Leu Ala Glu Lys He Arg Asn Tyr Leu 

80 85 90 

cca cac aag gag tta eta gaa ata gtc caa agt gta gat tct eta ata 3 99 

Pro His Lys Glu Leu Leu Glu He Val Gin Ser Val Asp Ser Leu lie 
95 100 105 

tct atg gag gaa cag etc gag act get ctg tea gta att aga get aag 447 
Ser Met Glu Glu Gin Leu Glu Thr Ala Leu Ser Val He Arg Ala Lys 
110 115 120 

aag aca gaa eta atg atg gag gat atg aag tea ctt caa gaa agg gag 495 
Lys Thr Glu Leu Met Met Glu Asp Met Lys Ser Leu Gin Glu Arg Glu 
125 130 135 

aag ttg ctg ata gaa gag aac cag att ctg get age cag gtg ggg aag 543 
Lys Leu Leu He Glu Glu Asn Gin lie Leu Ala Ser Gin Val Gly Lys 
140 145 150 155 

aag acg ttt ctg gtt ata gaa ggt gac aga gga atg tea egg gaa aat 591 
Lys Thr Phe Leu Val lie Glu Gly Asp Arg Gly Met Ser Arg Glu Asn 

160 165 170 

ggc tec ggc aac aaa gta ccg gag act ctt teg ctg etc aag taa 636 . 

Gly Ser Gly Asn Lys Val Pro Glu Thr Leu Ser Leu Leu Lys 
175 180 185 

tcaccatcat caaeggctga gctttcacca taaacttact cacagcctga ttcagaagct 696 

tttacaaaat tgtaaattat aaaaagctgc ataataatct caaccttttt atcttcctcg 756 

cgccaatgtg gaaataaagg taaaacaaaa cgaagctctt ttcttttatg cgaaagaatt 816 
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gtaaaactaa gataaagcta ccgatctttg ttgtacctta gtagacaaat atcagagttc 876 
ttgt 880 

<210> 12 
<211> 185 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 12 

Met Gly Arg Arg Lys Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly Leu lie Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser He Leu Cys Glu Ser Ser He Ala Val Val Ala Val 
35 40 45 

Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser Gly Asp Asn Met Ser 
50 55 60 

Lys He He Asp Arg Tyr Glu He His His Ala Asp Glu Leu Lys Ala 
65 70 75 80 

Leu Asp Leu Ala Glu Lys He Arg Asn Tyr Leu Pro His Lys Glu Leu 

85 90 95 

Leu Glu He Val Gin Ser Val Asp Ser Leu lie Ser Met Glu Glu Gin 
100 105 110 

Leu Glu Thr Ala Leu Ser Val He Arg Ala Lys Lys Thr Glu Leu Met 
115 120 125 

Met Glu Asp Met Lys Ser Leu Gin Glu Arg Glu Lys Leu Leu He Glu 
130 135 140 

Glu Asn Gin He Leu Ala Ser Gin Val Gly Lys Lys Thr Phe Leu Val 
145 150 155 160 

He Glu Gly Asp Arg Gly Met Ser Arg Glu Asn Gly Ser Gly Asn Lys 

165 170 175 

Val Pro Glu Thr Leu Ser Leu Leu Lys 
180 185 



<210> 13 
<211> 978 
<212> DNA 
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<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (219) - . (452) 

<223> G1842.6 



<400> 13 



actattacat 


gcctcttcct 


cgcttcaaaa 


cggcaccgtt tccacttgtt attatttttc 


60 


tctctatcgt 


ctaacaaaaa 


aaaaaactga 


cttgggattt tttttcattt gtctagccca 


120 


aaagaagaag 


atagaaacga 


agaaaaaaag 


caaacacatt ttgggtcccc ggtggttagg 


180 


atcaaattag 


ggcacaaacc 


. ttatcggaga 


aagaagcc atg gga aga aga aaa gtc 
Met Gly Arg Arg Lys Val 


236 



1 5 

gag ate aag cga ate gag aac aaa age agt cga caa gtc act ttc tec 284 
Glu lie Lys Arg lie Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Ser 
10 15 20 

aaa cga cgc aaa ggt etc ate gaa aaa get cga caa ctt tea att etc 332 
Lys Arg Arg Lys Gly Leu lie Glu Lys Ala Arg Gin Leu Ser lie Leu 
25 30 35 

tgt gaa tct tec ate get gtt gtc gee gtc tec ggt tec gga aaa etc 380 
Cys Glu Ser Ser lie Ala Val Val Ala Val Ser Gly Ser Gly Lys Leu 
40 45 50 

tac gac tct gee tec ggt gac aag ate ttg cag aaa aaa ttc gga att 428 
Tyr Asp Ser Ala Ser Gly Asp Lys lie Leu Gin Lys Lys Phe Gly lie 
55 60 65 70 

ate ttc cac aca agg agt tac tag aaatagtcca aagattctct aatatctatg ' 482 
lie Phe His Thr Arg Ser Tyr 

75 



gaggaacagc 


tegagactge 


tctgtcagta 


attagagcta 


agaagacaga 


actaatgatg 


542 


gaggatatga 


agtcacttca 


agaaagggag 


aagttgctga 


tagaagagaa 


ccagattctg 


602 


getagecagg 


tggggaagaa 


gacgtttctg 


gttatagaag 


gtgacagagg 


aatgtcacgg 


662 


gaaaatggct 


ccggcaacaa 


agtaceggag 


actctttcgc 


tgctcaagta 


atcaccatca 


722 


teaaeggctg 


agctttcacc 


ataaacttac 


tcacagcctg 


attcagaagc 


ttttacaaaa 


782 


ttgtaaatta 


taaaaagctg 


cataataatc 


tcaacctttt 


tatcttcctc 


gcgccaatgt 


842 


ggaaataaag 


gtaaaacaaa 


acgaagctct 


tttcttttat 


gcgaaagaat 


tgtaaaacta 


902 


agataaagct 


accgatcttt 


gttgtacctt 


agtagacaaa 


tatcagagtt 


cttgtgcttg 


962 


aaaaaaaaaa 


aaaaaa 










978 



<210> 14 

<211> 77 

<212> PRT 

<213> Arabidopsis thaliana 
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<400> 14 

Met Gly Arg Arg Lys Val Glu lie Lys Arg He Glu Asn Lys Ser Ser 
15 10 15 



Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly Leu He Glu Lys Ala 
20 25 30 



Arg Gin Leu Ser He Leu Cys Glu Ser Ser He Ala Val Val Ala Val 
35 40 45 



Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser Gly Asp Lys He Leu 

50 . .. 55 60 



Gin Lys Lys Phe Gly He He Phe His Thr Arg Ser Tyr 
65 70 75 



<210> 


15 


<211> 


876 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(80) . . (436) 


<223> 


G1842.7 


<400> 


15 



cagaaaaaaa gcaaacacat tttgggtccc cggtggttag gatcaaatta gggcacaaac 60 

cttatcggag aaagaagcc atg gga aga aga aaa gtc gag ate aag cga ate 112 

Met Gly Arg Arg Lys Val Glu He Lys Arg He 
1 5 10 

gag aac aaa age agt cga caa gtc act ttc tec aaa cga cgc aaa ggt 160 
Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly 
15 20 25 

etc ate gaa aaa get cga caa ctt tea att etc tgt gaa tct tec ate 208 
Leu He Glu Lys Ala Arg Gin Leu Ser He Leu Cys Glu Ser Ser He 
30 35 40 

get gtt gtc gee gtc tec ggt tec gga aaa etc tac gac tct gee tec 256 
Ala Val Val Ala Val Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser 
45 50 55 

ggt gac aac atg tea aag ate att gat cgt tat gaa ata cat cat get 304 
Gly Asp Asn Met Ser Lys He He Asp Arg Tyr Glu lie His His Ala 
60 65 70 75 

gat gaa ctt aaa gee tta gat ctt gca gaa aaa att egg aat tat ctt 352 
Asp Glu Leu Lys Ala Leu Asp Leu Ala Glu Lys He Arg Asn Tyr Leu 

80 85 90 

cca cac aag gag tta eta gaa ata gtc caa aga ttc tct aat ate tat 400 
Pro His Lys Glu Leu Leu Glu lie Val Gin Arg Phe Ser Asn lie Tyr 
95 100 105 
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gga gga aca get cga gac tgc tct gtc agt aat tag agctaagaag 1 446 

Gly Gly Thr Ala Arg Asp Cys Ser Val Ser Asn 
110 115 

acagaactaa tgatggagga tatgaagtca cttcaagaaa gggagaagtt gctgatagaa 506 

gagaaccaga ttctggctag ccaggtgggg aagaagacgt ttctggttat agaaggtgac 566 

agaggaatgt caegggaaaa tggctccggc aacaaagtac eggagactet ttcgctgctc 626 

aagtaatcac catcatcaac ggctgagctt tcaccataaa cttactcaca gectgattea 686 

gaagctttta caaaattgta aattataaaa agetgeataa taatctcaac ctttttatct 746 

tcctcgcgcc aatgtggaaa taaaggtaaa acaaaacgaa gctcttttct tttatgcgaa 806 

agaattgtaa aactaagata aagctaccga tctttgttgt accttagtag acaaatatca 866 

gagttcttgt ■ 876 

<210> 16 
<211> 118 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

•Met Gly Arg Arg Lys Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Lys Gly Leu lie Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser lie Leu Cys Glu Ser Ser lie Ala Val Val Ala Val 
35 40 45 

Ser Gly Ser Gly Lys Leu Tyr Asp Ser Ala Ser Gly Asp Asn Met Ser 
50 55 60 

Lys lie lie Asp Arg Tyr Glu lie His His Ala Asp Glu Leu Lys Ala 
65 70 75 80 

Leu Asp Leu Ala Glu Lys lie Arg Asn Tyr Leu Pro His Lys Glu Leu 

85 90 95 

Leu Glu lie Val Gin Arg Phe Ser Asn lie Tyr Gly Gly Thr Ala Arg 
100 105 110 

Asp Cys Ser Val Ser Asn 
115 



<210> 17 
<211> 818 
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<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (51) . . (653) 

<223> G1843 

<400> 17 

cagacatcac aatcaaatta ggtcagaaga attagtcgga gaaaacagcc atg gga 56 

Met Gly 
1 

aga aga aaa gta gag ate aaa cga att gag aac aaa age tct cga caa 104 
Arg Arg Lys Val Glu lie. Lys Arg lie Glu Asn Lys Ser Ser Arg Gin 
5 " 10 15 

gtt act ttc tgt aaa cga cga aat ggt etc atg gag aaa get cgt caa 152 
Val Thr Phe' Cys Lys Arg Arg Asn Gly Leu Met Glu Lys Ala Arg Gin 
20 25 30 

etc tea att ctt tgt gaa tec tec gtc get ctt ate ate ate tct gee 200 
Leu Ser lie Leu Cys Glu Ser Ser Val Ala Leu lie lie lie Ser Ala 
35 40 45 50 

ace gga aga etc tac age ttc tec tea ggt gat age atg gee aag ate 248 
Thr Gly Arg Leu Tyr Ser Phe Ser Ser Gly Asp Ser Met Ala Lys lie 

55 60 65 

etc agt cgt tat gaa tta gaa cag get gat gat ctt aaa ace ttg gat 296 
Leu Ser Arg Tyr Glu Leu Glu Gin Ala Asp Asp Leu Lys Thr Leu Asp 
70 75 80 

eta gaa gaa aaa act ctt aat tat ctt teg cac aag gag ttg eta gaa 344 
Leu Glu Glu Lys Thr Leu Asn Tyr Leu Ser His Lys Glu Leu Leu Glu 
85 90 95 

aca ate caa tgc aag att gaa gaa gcg aaa age gat aat gta agt ata 392 
Thr lie Gin Cys Lys lie Glu Glu Ala Lys Ser Asp Asn Val Ser lie 
100 105 110 

gat tgt eta aag tec ctg gaa gag cag etc aag act get ctg tct gta 440 
Asp Cys Leu Lys Ser Leu Glu Glu Gin Leu Lys Thr Ala Leu Ser Val 
115 120 125 130 

act aga get agg aag aca gaa eta atg atg gag ctt gtg aag ace cat 488 
Thr Arg Ala Arg Lys Thr Glu Leu Met Met Glu Leu Val Lys Thr His 

135 140 145 

caa gag aag gag aag ctg ctg aga gag gag aac cag agt ttg act aac 53 6 

Gin Glu Lys Glu Lys Leu Leu Arg Glu Glu Asn Gin Ser Leu Thr Asn 
150 155 160 

cag ctt ata aag atg ggg aag atg aag aag tct gtg gaa gca gag gat 5 84 

Gin Leu lie Lys Met Gly Lys Met Lys Lys Ser Val Glu Ala Glu Asp 
165 170 175 

gca aga gca atg tea ccg gaa agt age tct gac aac aag cca ccg gag 632 
Ala Arg Ala Met Ser Pro Glu Ser Ser Ser Asp Asn Lys Pro Pro Glu 
180 185 190 

act etc ctg ctt etc aag taa ccaccatcac caacgactga ttcgaaaaat 683 
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Thr Leu Leu Leu Leu Lys 
195 200 

aaaaattgta aaaattatga tttgtagttc ataaggaaag ctacatactg tatgttaaaa 743 

atcctcttct tccccctgct acggaaaagt catccaagga gatgcatcaa ataaagtaat — 803 

tgatttttat tgtta 818 

<210> 18 
<211> 200 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 - , 

Met Gly Arg Arg Lys Val Glu lie Lys Arg He Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Met Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser He Leu Cys Glu Ser Ser Val Ala Leu He He He 
35 40 45 

Ser Ala Thr Gly Arg Leu Tyr Ser Phe Ser Ser Gly Asp Ser Met Ala 
50 55 60 

Lys He Leu Ser Arg Tyr Glu Leu Glu Gin Ala Asp Asp Leu Lys Thr 
65 70 75 80 

Leu Asp Leu Glu Glu Lys Thr Leu Asn Tyr Leu Ser His Lys Glu Leu 

85 90 95 

Leu Glu Thr He Gin Cys Lys He Glu Glu Ala Lys Ser Asp Asn Val 
100 105 110 

Ser He Asp Cys Leu Lys Ser Leu Glu Glu Gin Leu Lys Thr Ala Leu 
115 120 125 

Ser Val Thr Arg Ala Arg Lys Thr Glu Leu Met Met Glu Leu Val Lys 
130 135 140 

Thr His Gin Glu Lys Glu Lys Leu Leu Arg Glu Glu Asn Gin Ser Leu 
145 150 155 160 

Thr Asn Gin Leu He Lys Met Gly Lys Met Lys Lys Ser Val Glu Ala 

165 170 175 

Glu Asp Ala Arg Ala Met Ser Pro Glu Ser Ser Ser Asp Asn Lys Pro 
180 185 190 
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Pro Glu Thr Leu Leu Leu Leu Lys 

200 





195 


<210> 


19 


<211> 


834 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(39) . . (635) 


<223> 


G1844 


<400> 


19 



agaaattagg ggattagatg tgtcggaaga gtgaagcc atg gga aga aga aga gta 56 

Met Gly Arg Arg Arg Val 
1 5 

gag ate aaa cga att gag aac aaa age agt aga caa gtc act ttc tgt 104 
Glu lie Lys Arg lie Glu Asn Lys Ser Ser Arg Gin Val Thr Phe Cys 
10 15 20 

aag aga cga aat ggt etc atg gag aaa get cgt caa etc tea att etc 152 
Lys Arg Arg Asn Gly Leu Met Glu Lys Ala Arg Gin Leu Ser lie Leu 
25 30 35 

tgt gga tec tec gtc get ctt ttc ate gtc tct tec ace ggc aaa etc 200 
Cys Gly Ser Ser Val Ala Leu Phe lie Val Ser Ser Thr Gly Lys Leu 
40 45 50 

tac aac tec tec tec ggc gac age atg gee aag ate ate agt cgt ttt 248 
Tyr Asn Ser Ser Ser Gly Asp Ser Met Ala Lys lie lie Ser Arg Phe 
55 60 65 70 

aaa ata caa caa get gat gat cct gaa acc ttg gat ctt gaa gac aaa 296 
Lys lie Gin Gin Ala Asp Asp Pro Glu Thr Leu Asp Leu Glu Asp Lys 

75 80 85 

act cag gat tat ctt tea cac aag gag tta eta gaa ata gtt caa aga 344 
Thr Gin Asp Tyr Leu Ser His Lys Glu Leu Leu Glu lie Val Gin Arg 
90 95 100 

aag att gaa gaa gca aaa ggg gat aat gta agt ata gaa tct eta att 3 92 

Lys lie Glu Glu Ala Lys Gly Asp Asn Val Ser lie Glu Ser Leu lie 
105 110 115 

tec atg gaa gag cag etc aag agt get ctg tct gta att aga get agg 440 
Ser Met Glu Glu Gin Leu Lys Ser Ala Leu Ser Val lie Arg Ala Arg 
120 125 130 

aag aca gag tta ttg atg gag ctt gtg aag aac ctt cag gat aag gag 4 88 

Lys Thr Glu Leu Leu Met Glu Leu Val Lys Asn Leu Gin Asp Lys Glu 
135 140 145 150 

aag ttg ctg aaa gaa aag aac aag gtt eta get age gag gtg ggg aag 536 
Lys Leu Leu Lys Glu Lys Asn Lys Val Leu Ala Ser Glu Val Gly Lys 

155 160 165 

ctg aag aaa att ttg gaa aca ggg gat gaa aga gca gta atg tea ccg 584 
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Leu Lys Lys He Leu Glu Thr Gly Asp Glu Arg Ala Val Met Ser Pro 
170 175 180 

gaa aat age tct ggc cac age cca ccg gag act etc ccg ctt etc aag 632 
Glu Asn Ser Ser Gly His Ser Pro Pro Glu Thr Leu Pro Leu Leu Lys 
185 190 195 

taa ccaccaatca teaaeggctg atttttcatc atcctgattc aaaaaaggta 685 

aaaaaaattc atgtgtaaaa atcataaaga agctacatgt tttaaaatcc tcttctcccc 745 

ctgeataegg ataaatttat agaccaaaaa tataatgttt tccctcaaat aagatatcga 805 

cctttgtgtt accttggaag acaggatca 834 

<210> 20 
<211> 198 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 20 

Met Gly Arg Arg Arg Val Glu He Lys Arg He Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Met Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser He Leu Cys Gly Ser Ser Val Ala Leu Phe He Val 
35 40 45 

Ser Ser Thr Gly Lys Leu Tyr Asn Ser Ser Ser Gly Asp Ser Met Ala 
50 55 60 

Lys lie He Ser Arg Phe Lys lie Gin Gin Ala Asp Asp Pro Glu Thr 
65 70 75 80 

Leu Asp Leu Glu Asp Lys Thr Gin Asp Tyr Leu Ser His Lys Glu Leu 

85 90 95 

Leu Glu lie Val Gin Arg Lys lie Glu Glu Ala Lys Gly Asp Asn Val 
100 105 110 

Ser He Glu Ser Leu lie Ser Met Glu Glu Gin Leu Lys Ser Ala Leu 
115 120 125 

Ser Val lie Arg Ala Arg Lys Thr Glu Leu Leu Met Glu Leu Val Lys 
130 135 140 

Asn Leu Gin Asp Lys Glu Lys Leu Leu Lys Glu Lys Asn Lys Val Leu 
145 150 155 160 
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Ala Ser Glu Val Gly Lys Leu Lys Lys lie Leu Glu Thr Gly Asp Glu 

165 170 175 



Arg Ala Val Met Ser Pro Glu Asn Ser Ser Gly His Ser Pro Pro. Glu 
180 185 190 



Thr Leu Pro Leu Leu Lys 
195 



<210> 


21 


<211> 


753 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (555) 


<223> 


G1844.2 


<400> 


21 



atg gga aga aga aga gta gag ate aaa cga att gag aac aaa age agt 4 8 

Met Gly Arg Arg Arg Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
1 5 10 15 

aga caa gtc act ttc tgt aag aga cga aat ggt etc atg gag aaa get 96 
Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Met Glu Lys Ala 
20 25 30 

cgt caa etc tea att etc tgt gga tec tec gtc get ctt ttc ate gtc 144 
Arg Gin Leu Ser lie Leu Cys Gly Ser Ser Val Ala Leu Phe lie Val 
35 40 45 

tct tec ace ggc aaa etc tac aac tec tec tec ggc gac age atg gee 192 
Ser Ser Thr Gly Lys Leu Tyr Asn Ser Ser Ser Gly Asp Ser Met Ala 
50 55 60 

aag ate ate agt cgt ttt aaa ata caa caa get gat gat cct gaa acc 240 
Lys lie lie Ser Arg Phe Lys lie Gin Gin Ala Asp Asp Pro Glu Thr 
65 70 75 80 

ttg gat ctt gaa gac aaa act cag gat tat ctt tea cac aag gag tta 288 
Leu Asp Leu Glu Asp Lys Thr Gin Asp Tyr Leu Ser His Lys Glu Leu 

85 90 95 

eta gaa ata gtt caa aga aag att gaa gaa gca aaa ggg gat aat gta 336 
Leu Glu lie Val Gin Arg Lys lie Glu Glu Ala Lys Gly Asp Asn Val 
100 105 110 

agt ata gaa tct eta att tec atg gaa gag cag etc aag agt get ctg 384 
Ser lie Glu Ser Leu lie Ser Met Glu Glu Gin Leu Lys Ser Ala Leu 
115 120 125 

tct gta att aga get agg aag aca gag tta ttg atg gag ctt gtg aag 432 
Ser Val lie Arg Ala Arg Lys Thr Glu Leu Leu Met Glu Leu Val Lys 
130 135 140 

aac ctt cag gat aag gtg ggg aag ctg aag aaa att ttg gaa aca ggg 480 
Asn Leu Gin Asp Lys Val Gly Lys Leu Lys Lys lie Leu Glu Thr Gly 
145 150 155 160 
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gat gaa aga gca gta atg tea ccg gaa aat age tct ggc cac age cca 528 

Asp Glu Arg Ala Val Met Ser Pro Glu Asn Ser Ser Gly His Ser Pro 

165 170 175 

ccg gag act etc ccg ctt etc aag taa ccaccaatca teaaeggctg 575 
Pro Glu Thr Leu Pro Leu Leu Lys 
180 

atttttcatc atcctgattc aaaaaaggta aaaaaaattc atgtgtaaaa atcataaaga 635 

agctacatgt tttaaaatcc tcttctcccc ctgeataegg ataaatttat agaccaaaaa 695 

tataatgttt tccctcaaat aagatatcga cctttgtgtt accttggaag acaggatc 753 

<210> 22 
<211> 184 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 22 

Met Gly Arg Arg Arg Val Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Met Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser lie Leu Cys Gly Ser Ser Val Ala Leu Phe lie Val 
35 40 45 

Ser Ser Thr Gly Lys Leu Tyr "Asn Ser Ser Ser Gly Asp Ser Met Ala 
50 55 60 

Lys lie lie Ser Arg Phe Lys He Gin Gin Ala Asp Asp Pro Glu Thr 
65 70 75 80 

Leu Asp Leu Glu Asp Lys Thr Gin Asp Tyr Leu Ser His Lys Glu Leu 

85 90 95 

Leu Glu He Val Gin Arg Lys He Glu Glu Ala Lys Gly Asp Asn Val 
100 105 110 

Ser He Glu Ser Leu He Ser Met Glu Glu Gin Leu Lys Ser Ala Leu 
115 120 125 

Ser Val He Arg Ala Arg Lys Thr Glu Leu Leu Met Glu Leu Val Lys 
130 135 140 

Asn Leu Gin Asp Lys Val Gly Lys Leu Lys Lys He Leu Glu Thr Gly 
145 150 155 160 
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Asp Glu Arg Ala Val Met Ser Pro Glu Asn Ser Ser Gly His Ser Pro 

165 170 175 



Pro Glu Thr Leu Pro Leu Leu Lys 
180 



<210> 


23 


<211> 


1134 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(158) . . (880) 


<223> 


G861 


<400> 


23 ■ 



cttcttcctc ctcctccatc tcttctcttt actctctctt taatcatctc tcattcttga 60 

atcttgatcc atcaaaatca atcccgttct cgaaagatcc attaaaatca aaacctaagc 120 

tctctctctt gcttctaggg tttttttgtt cgttgtg atg gcg aga gaa aag att 175 

Met Ala Arg Glu Lys lie 
1 5 

cag ate agg aag ate gac aac gca acg gcg aga caa gtg acg ttt teg 223 
Gin lie Arg Lys lie Asp Asn Ala Thr Ala Arg Gin Val Thr Phe Ser 
10 15 20 

aaa cga aga aga ggg ctt ttc aag aaa get gaa gaa etc tec gtt etc 271 
Lys Arg Arg Arg Gly Leu Phe Lys Lys Ala Glu Glu Leu Ser Val Leu 
25 30 35 

tgc gac gee gat gtc get etc ate ate ttc tct tec acc gga aaa ctg 319 
Cys Asp Ala Asp Val Ala Leu lie lie Phe Ser Ser Thr Gly Lys Leu 
40 45 50 

ttc gag ttc tgt age tec age atg aag gaa gtc eta gag agg cat aac 3 67 

Phe Glu Phe Cys Ser Ser Ser Met Lys Glu Val Leu Glu Arg His Asn 
55 60 65 70 

ttg cag tea aag aac ttg gag aag ctt gat cag cca tct ctt gag tta 415 
Leu Gin Ser Lys Asn Leu Glu Lys Leu Asp Gin Pro Ser Leu Glu Leu 

75 80 85 

cag ctg gtt gag aac agt gat cac gee cga atg agt aaa gaa att gcg 463 
Gin Leu Val Glu Asn Ser Asp His Ala Arg Met Ser Lys Glu lie Ala 
90 95 100 

gac aag age cac cga eta agg caa atg aga gga gag gaa ctt caa gga 511 
Asp Lys Ser His Arg Leu Arg Gin Met Arg Gly Glu Glu Leu Gin Gly 
105 110 115 

ctt gac att gaa gag ctt cag cag eta gag aag gee ctt gaa act ggt 559 
Leu Asp lie Glu Glu Leu Gin Gin Leu Glu Lys Ala Leu Glu Thr Gly 
120 125 130 

ttg acg cgt gtg att gaa aca aag agt gac aag att atg agt gag ate 607 
Leu Thr Arg Val lie Glu Thr Lys Ser Asp Lys lie Met Ser Glu lie 
135 140 145 150 
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age gaa ctt cag aaa aag gga atg caa ttg atg gat gag aac aag egg 655 

Ser Glu Leu Gin Lys Lys Gly Met Gin Leu Met Asp Glu Asn Lys Arg 

155 160 * 165 

ttg agg cag caa gga acg caa eta acg gaa gag aac gag cga ctt ggc 703 

Leu Arg Gin Gin Gly Thr Gin Leu Thr Glu Glu Asn Glu Arg Leu Gly 

170 175 180 

atg caa ata tgt aac aat gtg cat gca cac ggt ggt get gaa teg gag 751 

Met Gin lie Cys Asn Asn Val His Ala His Gly Gly Ala Glu Ser Glu 

185 190 195 

aac get get gtg tac gag gaa gga cag teg teg gag tct att act aac 799 

Asn Ala Ala Val Tyr Glu Glu Gly Gin Ser Ser Glu Ser lie Thr Asn 

200 -205 210 

gec gga aac tct acc gga gcg cct gtt gac tec gag age tec gac act 847 

Ala Gly Asn Ser Thr Gly Ala Pro Val Asp Ser Glu Ser Ser Asp Thr 

215 220 225 230 

tec ctt agg etc ggc tta ccg tat ggt ggt tag agatggaaca attcaaagaa 900 

Ser Leu Arg Leu Gly Leu Pro Tyr Gly Gly 





235 




240 








gttgatggag 


tgaggagagt 


aatgtaaatc 


tttttaactc 


ggtagtaaca 


agagacaatg 


960 


tctaagtagt 


gaattctcaa 


atgtttgtgt 


aagtttctgc 


ctatggaaga 


ggctttcatt 


1020 


tttatgattt 


tcactatgta 


tgatctctct 


teactgeatt 


tctggttagt 


aacggcttgt 


1080 


caccgataaa 


etttctegtt 


atggaaagtt 


agaataaaaa 


aaaaaaaaaa 


aaaa 


1134 



<210> 24 
<211> 240 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 24 

Met Ala Arg Glu Lys lie Gin lie Arg Lys lie Asp Asn Ala Thr Ala 
15 10 15 



Arg Gin Val Thr Phe Ser Lys Arg Arg Arg Gly Leu Phe Lys Lys Ala 
20 25 30 



Glu Glu Leu Ser Val Leu Cys Asp Ala Asp Val Ala Leu lie lie Phe 
35 40 45 



Ser Ser Thr Gly Lys Leu Phe Glu Phe Cys Ser Ser Ser Met Lys Glu 
50 55 60 



Val Leu Glu Arg His Asn Leu Gin Ser Lys Asn Leu Glu Lys Leu Asp 
65 70 75 80 



Gin Pro Ser Leu Glu Leu Gin Leu Val Glu Asn Ser Asp His Ala Arg 

85 90 95 
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Met Ser Lys Glu lie Ala Asp Lys Ser His Arg Leu Arg Gin Met Arg 
100 105 110 



Gly Glu Glu Leu Gin Gly Leu Asp lie Glu Glu Leu Gin Gin Leu Glu 
115 120 125 



Lys Ala Leu Glu Thr Gly Leu Thr Arg Val lie Glu Thr Lys Ser Asp 
130 135 140 



Lys lie Met Ser Glu lie Ser Glu Leu Gin Lys Lys Gly Met Gin Leu 
145 , 150 - - 155 160 



Met Asp Glu Asn Lys Arg Leu Arg Gin Gin Gly Thr Gin Leu Thr Glu 

165 170 175 



Glu Asn Glu Arg Leu Gly Met Gin lie Cys Asn Asn Val His Ala His 
180 185 190 



Gly Gly Ala Glu Ser Glu Asn Ala Ala Val Tyr Glu Glu Gly Gin Ser 
195 200 205 



Ser Glu Ser He Thr Asn Ala Gly Asn Ser Thr Gly Ala Pro Val Asp 
210 215 220 



Ser Glu Ser Ser Asp Thr Ser Leu Arg Leu Gly Leu Pro Tyr Gly Gly 
225 230 235 240 



<210> 


25 


<211> 


1552 


<212> 


DNA 


<213> 


Arabidopsis thai i ana 


<220> 




<221> 


CDS 


<222> 


(193) . . (825) 


<223> 


G861.1 


<400> 


25 



attctctctc tcncaaaatt tatttcctct ggcttcttct tcctcctcct ccatctcttc 60 

tctttactct ctctttaatc atctctcatt cttgaatctt gatccatcaa aatcaatccc 120 

gttctcgaaa gatccattaa aatcaaaacc taagctctct ctcttgcttc tagggttttt 180 

ttgttcgttg tg atg gcg aga gaa aag att cag ate agg aag ate gac aac 231 
Met Ala Arg Glu Lys He Gin He Arg Lys He Asp Asn 
15 10 

gca acg gcg aga caa gtg acg ttt teg aaa cga aga aga ggg ctt ttc 27 9 

Ala Thr Ala Arg Gin Val Thr Phe Ser Lys Arg Arg Arg Gly Leu Phe 
15 20 25 
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aag aaa get gaa gaa etc tec gtt etc tgc gac gee gat gtc get etc 327 

Lys Lys Ala Glu Glu Leu Ser Val Leu Cys Asp Ala Asp Val Ala Leu 
30 35 40 45 

ate ate ttc tct tec acc gga aaa ctg ttc gag ttc tgt age tec age 375 

lie lie Phe Ser Ser Thr Gly Lys Leu Phe Glu Phe Cys Ser Ser Ser 

50 55 60 

atg aag gaa gtc eta gag agg cat aac ttg cag tea aag aac ttg. gag 423 

Met Lys Glu Val Leu Glu Arg His Asn Leu Gin Ser Lys Asn Leu Glu 
65 70 75 

aag ctt gat cag cca tct ctt gag tta cag ctg gtt gag aac agt gat 471 

Lys Leu Asp Gin Pro Ser Leu Glu Leu Gin Leu Val Glu Asn Ser Asp 
80 85 90 

cac gee cga atg agt aaa gaa att gcg gac aag age cac cga eta agg 519 

His Ala Arg Met Ser Lys Glu lie Ala Asp Lys Ser His Arg Leu Arg 
95 100 105 

caa atg aga gga gag gaa ctt caa gga ctt gac att gaa gag ctt cag 567 

Gin Met Arg Gly Glu Glu Leu Gin Gly Leu Asp lie Glu Glu Leu Gin 
110 115 120 125 

cag eta gag aag gee ctt gaa act ggt ttg acg cgt gtg att gaa aca 615 

Gin Leu Glu Lys Ala Leu Glu Thr Gly Leu Thr Arg Val lie Glu Thr 

130 135 140 

aag agt gac aag att atg agt gag ate age gaa ctt cag aaa aag gga 663 

Lys Ser Asp Lys lie Met Ser Glu lie Ser Glu Leu Gin Lys Lys Gly 
145 150 155 

atg caa ttg atg gat gag aac aag egg ttg agg cag caa gta tgt gtc 711 

Met Gin Leu Met Asp Glu Asn Lys Arg Leu Arg Gin Gin Val Cys Val 
160 165 170 

tta ccc tct ctg ttg ata aca aat ccc ttt ctt ttg tct acc att aac 759 

Leu Pro Ser Leu Leu lie Thr Asn Pro Phe Leu Leu Ser Thr He Asn 
175 180 185 

gta cac act cct aaa ttt aat ccc cag ttg tct aca aca cat atg ttt 807 

Val His Thr Pro Lys Phe Asn Pro Gin Leu Ser Thr Thr His Met Phe 
190 195 200 205 

gat cat act gtg. aga taa atgaataaac caagtgatat agegegattt 855 
Asp His Thr Val Arg 

210 



aaaaatgtct 


ttaaaactaa 


aggtaaccat 


gtagctagtt 


agtctctagg 


gtcctagagg 


915 


tctacgagtg 


tgcatgcatg 


gatttggtgc 


gttttttctt 


tttcatcttc 


attttgtttt 


975 


ttgaaacaag 


gaaccataaa 


cgaatatata 


tctaattctt 


gtttgatata 


tagtttggtc 


1035 


gaggcttcat 


gtcaagattt 


getcattegt 


agttagttga 


tctctagaga 


aattcaaaac 


1095 


acatggtgcc 


actaaaaaca 


caaaatgeaa 


atacttagct 


agagaactta 


atgatatgtt 


1155 


ttgtcttgat 


ttttgcaggg 


aacgeaacta 


aeggaagaga 


aegagegact 


tggcatgcaa 


1215 


atatgtaaca 


atgtgcatgc 


acacggtggt 


getgaategg 


agaaegctge 


tgtgtacgag 


1275 
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gaaggacagt 


cgtcggagtc 


tattactaac 


gccggaaact 


ctaccggagc 


gcctgttgac 


1335 


tccgagagct 


ccgacacttc 


ccttaggctc 


ggcttaccgt 


atggtggtta 


gagatggaac 


1395 


aattcaaaga 


agttgatgga 


gtgaggagag 


taatgtaaat 


ctttttaact 


cggtagtaac 


1455 


aagagacaat 


gtctaagtag 


tgaattctca 


aatgtttgtg 


taagtttctg 


cctatggaag 


1515 


aggctttcat 


ttttatgatt 


aaaaaaaaaa 


aaaaaaa 






1552 



<210> 26 

<211> 210 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 26 

Met Ala Arg Glu Lys lie Gin lie Arg Lys lie Asp Asn Ala Thr Ala 
1*5 10 15 



Arg Gin Val Thr Phe Ser Lys Arg Arg Arg Gly Leu Phe Lys Lys Ala 
20 25 30 



Glu Glu Leu Ser Val Leu Cys Asp Ala Asp Val Ala Leu lie lie Phe 
35 40 45 



Ser Ser Thr Gly Lys Leu Phe Glu Phe Cys Ser Ser Ser Met Lys Glu 
50 55 60 



Val Leu Glu Arg His Asn Leu Gin Ser Lys Asn Leu Glu Lys Leu Asp 
65 70 75 80 



Gin Pro Ser Leu Glu Leu Gin Leu Val Glu Asn Ser Asp His Ala Arg 

85 90 95 



Met Ser Lys Glu lie Ala Asp Lys Ser His Arg Leu Arg Gin Met Arg 
100 105 110 



Gly Glu Glu Leu Gin Gly Leu Asp lie Glu Glu Leu Gin Gin Leu Glu 
115 120 125 



Lys Ala Leu Glu Thr Gly Leu Thr Arg Val lie Glu Thr Lys Ser Asp 

130 135 140 

Lys lie Met Ser Glu lie Ser Glu Leu Gin Lys Lys Gly Met Gin Leu 

145 150 155 160 



Met Asp Glu Asn Lys Arg Leu Arg Gin Gin Val Cys Val Leu Pro Ser 

165 170 175 



Leu Leu lie Thr Asn Pro Phe Leu Leu Ser Thr lie Asn Val His Thr 
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180 185 190 



Pro Lys Phe Asn Pro Gin Leu Ser Thr Thr His Met Phe Asp His Thr 
195 200 205 



Val Arg 
210 



<210> 


27 


<211> 


943 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(110) . . (700) 


<223> 


G1759 


<400> 


27 



cgagaaaagg aaaaaaaaaa atagaaagag aaaacgctta gtatctccgg cgacttgaac 60 

ccaaacctga ggatcaaatt agggcacaaa gccctctcgg agagaagcc atg gga aga 118 

Met Gly Arg 
1 

aaa aaa eta gaa ate aag cga att gag aac aaa agt age cga caa gtc 166 
Lys Lys Leu Glu lie Lys Arg lie Glu Asn Lys Ser Ser Arg Gin Val 
5 10 15 

acc ttc tec aaa cgt cgc aac ggt etc ate gag aaa get cgt cag ctt 214 
Thr Phe Ser Lys Arg Arg Asn Gly Leu lie Glu Lys Ala Arg Gin Leu 
20 25 30 35 

tct gtt etc tgt gac gca tec gtc get ctt etc gtc gtc tec gec tec 262 
Ser Val Leu Cys Asp Ala Ser Val Ala Leu Leu Val Val Ser Ala Ser 

40 45 50 

ggc aag etc tac age ttc tec tec ggc gat aac ctg gtc aag ate ctt 310 
Gly Lys Leu Tyr Ser Phe Ser Ser Gly Asp Asn Leu Val Lys lie Leu 
55 60 65 

gat cga tat ggg aaa cag cat get gat gat ctt aaa gee ttg gat cat 358 
Asp Arg Tyr Gly Lys Gin His Ala Asp Asp Leu Lys Ala Leu Asp His 
70 75 80 

cag tea aaa get ctg aac tat ggt tea cac tat gag eta ctt gaa ctt 406 
Gin Ser Lys Ala Leu Asn Tyr Gly Ser His Tyr Glu Leu Leu Glu Leu 
85 90 95 

gtg gat age aag ctt gtg gga tea aat gtc aaa aat gtg agt ate gat 454 
Val Asp Ser Lys Leu Val Gly Ser Asn Val Lys Asn Val Ser lie Asp 
100 105 110 115 

get ctt gtt caa ctg gag gaa cac ctt gag act gec etc tec gtg act 502 
Ala Leu Val Gin Leu Glu Glu His Leu Glu Thr Ala Leu Ser Val Thr 

120 125 130 

aga gec aag aag acc gaa etc atg ttg aag ctt gtt gag aat ctt aaa 55 0 

Arg Ala Lys Lys Thr Glu Leu Met Leu Lys Leu Val Glu Asn Leu Lys 
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135 140 145 

gaa aag gag aaa atg ctg aaa gaa gag aac cag gtt ttg get age cag 598 
Glu Lys Glu Lys Met Leu Lys Glu Glu Asn Gin Val Leu Ala Ser Gin 
150 155 160 

atg gag aat aat cat cat gtg gga gca gaa get gag atg gag atg tea 646 
Met Glu Asn Asn His His Val Gly Ala Glu Ala Glu Met Glu Met Ser 
165 170 175 

cct get gga caa ate tec gac aat ctt ccg gtg act etc cca eta ctt 694 
Pro Ala Gly Gin lie Ser Asp Asn Leu Pro Val Thr Leu Pro Leu Leu 
180 185 190 195 

aat tag ccaccttaaa tcggcggttg aaatcaaaat ccaaaacata tataattatg 750 
Asn , 

aagaaaaaaa aaataagata tgtaattatt ccgctgataa gggcgagcgt ttgtatatct 810 

taatactctc tetttggeca agagactttg tgtgtgatac ttaagtagac ggaactaagt 870 

caatactatc tgttttaaga caaaaggttg. atgaactttg taccttattc gtgtgagaaa 930 

aaaaaaaaaa aaa 943 

<210> 28 
<211> 196 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 28 

Met Gly Arg Lys Lys Leu Glu lie Lys Arg lie Glu Asn Lys Ser Ser 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Asn Gly Leu lie Glu Lys Ala 
20 25 30 

Arg Gin Leu Ser Val Leu Cys Asp Ala Ser Val Ala Leu Leu Val Val 
35 40 45 

Ser Ala Ser Gly Lys Leu Tyr Ser Phe Ser Ser Gly Asp Asn Leu Val 
50 55 60 

Lys lie Leu Asp Arg Tyr Gly Lys Gin His Ala Asp Asp Leu Lys Ala 
65 70 75 80 

Leu Asp His Gin Ser Lys Ala Leu Asn Tyr Gly Ser His Tyr Glu Leu 

85 90 95 

Leu Glu Leu Val Asp Ser Lys Leu Val Gly Ser Asn Val Lys Asn Val 
100 105 110 

Ser He Asp Ala Leu Val Gin Leu Glu Glu His Leu Glu Thr Ala Leu 
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115 120 125 



Ser Val Thr Arg Ala Lys Lys Thr Glu Leu Met Leu Lys Leu Val Glu 
130 135 140 



Asn Leu Lys Glu Lys Glu Lys Met Leu Lys Glu Glu Asn Gin Val Leu 
145 150 155 160 

Ala Ser Gin Met Glu Asn Asn His His Val Gly Ala Glu Ala Glu Met 

165 170 175 

Glu Met Ser Pro- Ala Gly -Gin He Ser Asp Asn Leu Pro Val Thr Leu 
180 185 190 



Pro Leu Leu Asn 





195 


<210> 


29 


<211> 


1171 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(63) . . (959) 


<223> 


G192 


<400> 


29 



cttttttctc ttctctcctc agagattcga agctttttgt ctcccctgag taaccaaatt 60 

ca atg gcc gac gat tgg gat etc cac gec gta gtc aga ggc tgc tea 107 
Met Ala Asp Asp Trp Asp Leu. His Ala Val Val Arg Gly Cys Ser 
15 10 15 

gcc gta age tea tea get act acc acc gta tat tec ccc ggc gtt tea 155 

Ala Val Ser Ser Ser Ala Thr Thr Thr Val Tyr Ser Pro Gly Val Ser 

20 25 30 

tct cac aca aac cct ata ttc acc gtc gga cga caa agt aat gcc gtc 203 

Ser His Thr Asn Pro He Phe Thr Val Gly Arg Gin Ser Asn Ala Val 

35 40 45 

tec ttc gga gag att cga gat etc tac aca ccg ttc aca caa gaa tct 251 

Ser Phe Gly Glu He Arg Asp Leu Tyr Thr Pro Phe Thr Gin Glu Ser 
SO 55 60 

gtc gtc tct teg ttt tct tgt ata aac tac cca gaa gaa cct aga aag 2 99 

Val Val Ser Ser Phe Ser Cys He Asn Tyr Pro Glu Glu Pro Arg Lys 
65 70 75 

cca cag aac cag aaa cgt cct ctt tct etc tct get tct tec ggt age 347 

Pro Gin Asn Gin Lys Arg Pro Leu Ser Leu Ser Ala Ser Ser Gly Ser 
80 85 90 95 

gtc act age aaa ccc agt ggc tec aat acc tct aga tct aaa aga aga 3 95 

Val Thr Ser Lys Pro Ser Gly Ser Asn Thr Ser Arg Ser Lys Arg Arg 
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100 105 110 

aag ata cag cat aag aaa gtg tgc cat gta gca gca gaa get tta aac 443 
Lys lie Gin His Lys Lys Val Cys His Val Ala Ala Glu Ala Leu Asn 
115 120 125 

tec gat gtc tgg gca tgg cga aag tac gga cag aaa ccc ate aaa ggt 491 
Ser Asp Val Trp Ala Trp Arg Lys Tyr Gly Gin Lys Pro lie Lys Gly 
130 135 140 ' 

tea cca tat cca aga gga tac tac aga tgt agt aca tea aaa ggt tgt 539 
Ser Pro Tyr Pro Arg Gly Tyr Tyr Arg Cys Ser Thr Ser Lys Gly Cys 
145 150 155 

tta gec cgt aaa caa gtg gag cga aat aga tec gac ccg aag atg ttt 587 
Leu Ala Arg Lys Gin Val Glu Arg Asn Arg Ser Asp Pro Lys Met Phe 
160 165 170 175 

ate gtc act tac acg gcg gag cat aat cat cca get ccg aca cac cgt 635 
lie Val Thr Tyr Thr Ala Glu His Asn His Pro Ala Pro Thr His Arg 

180 185 190 

aat tct etc gee gga age aca cgt cag aaa cca tec gat caa cag acg 683 
Asn Ser Leu Ala Gly Ser Thr Arg Gin Lys Pro Ser Asp Gin Gin Thr 
195 200 205 

agt aaa tct ccg acg acc act att get act tat tea teg tct ccg gtg 731 
Ser Lys Ser Pro Thr Thr Thr lie Ala Thr Tyr Ser Ser Ser Pro Val 
210 215 220 

act tea gee gac gaa ttt gtt ttg cct gtt gag gat cat eta gcg gtg 779 
Thr Ser Ala Asp Glu Phe Val Leu Pro Val Glu Asp His Leu Ala Val 
225 230 235 

gga gat ctt gac gga gaa gaa gat ctg tta tct ttg teg gat acg gtg 827 
Gly Asp Leu Asp Gly Glu Glu Asp Leu Leu Ser Leu Ser Asp Thr Val 
240 245 250 255 

gtt age gat gat ttc ttc gat ggg tta gag gaa ttc gca gee gga gat 875 
Val Ser Asp Asp Phe Phe Asp Gly Leu Glu Glu Phe Ala Ala Gly Asp 

260 265 270 

age ttt tec ggg aac teg get ccg gcg agt ttt gat etc tct tgg gtt 923 
Ser Phe Ser Gly Asn Ser Ala Pro Ala Ser Phe Asp Leu Ser Trp Val 
275 280 285 

gtg aac agt gee gee act acc acc gga gga ata tga ttagattacg 969 
Val Asn Ser Ala Ala Thr Thr Thr Gly Gly lie 
290 295 

aeggcttaga atactcttat taggacagat ttataggatt aaggaattat teteggagea 1029 

tatgtaaaaa taggataaaa gaaaatgttc tttgttactt tttttcgggt tttcttccta 1089 

ttgtttctaa acatcttaga aaaaatttaa ttgtatattc cttaagctcg atacatcttg 1149 

ttttaaaaaa aaaaaaaaaa aa 1171 

<210> 30 
<211> 298 
<212> PRT 

<213> Arabidopsis thaliana 
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<400> 30 

Met Ala Asp Asp Trp Asp Leu His Ala Val Val Arg Gly Cys Ser Ala 
1 5 10 15 

Val Ser Ser Ser Ala Thr Thr Thr Val Tyr Ser Pro Gly Val Ser Ser 
20 25 30 

His Thr Asn Pro lie Phe Thr Val Gly Arg Gin Ser Asn Ala Val Ser 
35 40 45 

Phe Gly Glu lie. Arg Asp Leu Tyr Thr Pro Phe Thr Gin Glu Ser Val 
50 55 60 

Val Ser Ser Phe Ser Cys lie Asn Tyr Pro Glu Glu Pro Arg Lys Pro 
65 70 75 80 

Gin Asn Gin Lys Arg Pro Leu Ser Leu Ser Ala Ser Ser Gly Ser Val 

85 90 95 

Thr Ser Lys Pro Ser Gly Ser Asn Thr Ser Arg Ser Lys Arg Arg Lys 
100 105 110 

lie Gin His Lys Lys Val Cys His Val Ala Ala Glu Ala Leu Asn Ser 
115 120 125 

Asp Val Trp Ala Trp Arg Lys Tyr Gly Gin Lys Pro lie Lys Gly Ser 
130 135 140 

Pro Tyr Pro Arg Gly Tyr Tyr Arg Cys Ser Thr Ser Lys Gly Cys Leu 
145 150 155 160 

Ala Arg Lys Gin Val Glu Arg Asn Arg Ser Asp Pro Lys Met Phe lie 

165 170 175 

Val Thr Tyr Thr Ala Glu His Asn His Pro Ala Pro Thr His Arg Asn 
130 185 190 

Ser Leu Ala Gly Ser Thr Arg Gin Lys Pro Ser Asp Gin Gin Thr Ser 
195 200 205 

Lys Ser Pro Thr Thr Thr lie Ala Thr Tyr Ser Ser Ser Pro Val Thr 
210 215 220 

Ser Ala Asp Glu Phe Val Leu Pro Val Glu Asp His Leu Ala Val Gly 
225 230 235 240 
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Asp Leu Asp Gly Glu Glu Asp Leu Leu Ser Leu Ser Asp Thr Val Val 

245 250 255 



Ser Asp Asp Phe Phe Asp Gly Leu Glu Glu Phe Ala Ala Gly Asp Ser 
260 265 270 



Phe Ser Gly Asn Ser Ala Pro Ala Ser Phe Asp Leu Ser Trp Val Val 
275 280 285 



Asn Ser Ala Ala Thr Thr Thr Gly Gly He 
290 295 



<210> 


31 


<211> 


1139 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(106) . . (1032) 


<223> 


G234 


<400> 


31 



cacaacatca tacccaccaa catatataat cttgatcata gagagataaa cagaggccgc 60 

tatcaagaac aagactaaga acaagacttc actaggagta caagt atg gga aga gca 117 

Met Gly Arg Ala 
1 

ccg tgt tgt gac aaa gca aac gtg aag aaa ggg cct tgg tct cct gag 165 
Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro Trp Ser Pro Glu 
5 10 15 20 

gaa gat gca aaa etc aaa tct tac att gaa aat agt ggc acc gga ggc 213 
Glu Asp Ala Lys Leu Lys Ser Tyr He Glu Asn Ser Gly Thr Gly Gly 

25 30 35 

aat tgg ate get ttg cct caa aag att ggt tta aag aga tgt gga aag 261 
Asn Trp He Ala Leu Pro Gin Lys He Gly Leu Lys Arg Cys Gly Lys 
40 45 50 

agt tgc agg ctg agg tgg ctt aac tat ctt aga cca aac ate aaa cat 309 
Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro Asn He Lys His 
55 60 65. 

ggt ggc ttc tct gag gaa gaa gaa aac ate att tgt age ctt tac ctt 357 
Gly Gly Phe Ser Glu Glu Glu Glu Asn He He Cys Ser Leu Tyr Leu 
70 75 80 

aca att ggt age agg tgg tct ata ate get get caa ttg ccg gga cga 405 
Thr lie Gly Ser Arg Trp Ser He He Ala Ala Gin Leu Pro Gly Arg 
85 90 95 100 

aca gac aac gat ata aaa aac tat tgg aac acg agg etc aag aag aaa 4 53 

Thr Asp Asn Asp He Lys Asn Tyr Trp Asn Thr Arg Leu Lys Lys Lys 

105 110 115 

etc att aac aaa caa cgc aag gag ctt caa gaa get tgt atg gag cag 5 01 
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Leu lie Asn Lys Gin Arg Lys Glu Leu Gin Glu Ala Cys Met Glu Gin 
120 125 130 

caa gag atg atg gtg atg atg aag aga caa cac caa caa caa caa ate 54 9 

Gin Glu Met Met Val Met Met Lys Arg Gin His Gin Gin Gin Gin lie 
135 140 145 

caa act tct ttt atg atg aga caa gac caa aca atg ttc aca tgg cca 597 
Gin Thr Ser Phe Met Met Arg Gin Asp Gin Thr Met Phe Thr Trp Pro 
150 155 160 

eta cat cat cat aat gtt caa gtt cca get ctt ttc aga ate aaa cca 645 
Leu His His His Asn Val Gin Val Pro Ala Leu Phe Arg lie Lys Pro 
165 170 175 180 

act cgt ttt gcg acc aag aag atg tta age cag tgc tea tea aga aca 693 
Thr Arg Phe Ala Thr Lys Lys Met Leu Ser Gin Cys Ser Ser Arg Thr 

185 190 195 

tgg tea aga teg aag ate aag aac tgg aga aaa caa acc tea tea tea 741 
Trp Ser Arg Ser Lys lie Lys Asn Trp Arg Lys Gin Thr Ser Ser Ser 
200 205 210 

tea aga ttc aat gac aac get ttt gat cat etc tct ttc tct caa etc 789 
Ser Arg Phe Asn Asp Asn Ala Phe Asp His Leu Ser Phe Ser Gin Leu 
215 220 225 

ttg tta gat cct aat cat aac cac tta gga tea gga gag ggt ttc tec 837 
Leu Leu Asp Pro Asn His Asn His Leu Gly Ser Gly Glu Gly Phe Ser 
230 235 240 

atg aac tct ate ttg age gee aac aca aac tct cca ttg ctt aac aca 885 
Met Asn Ser lie Leu Ser Ala Asn Thr Asn Ser Pro Leu Leu Asn Thr 
245 250 255 260 

agt aat gat aat cag tgg ttc ggg aat ttc cag gec gaa acc gta aac 93 3 

Ser Asn Asp Asn Gin Trp Phe Gly Asn Phe Gin Ala Glu Thr Val Asn 

265 270 275 

ttg ttc tea gga gee tec aca agt act teg gca gat caa age act ata 981 
Leu Phe Ser Gly Ala Ser Thr Ser Thr Ser Ala Asp Gin Ser Thr lie 
280 285 290 

agt tgg gaa gac ata age tct ctt gtt tat tct gat tea aag caa ttt 102 9 

Ser Trp Glu Asp He Ser Ser Leu Val Tyr Ser Asp Ser Lys Gin Phe 
295 300 305 

ttt taattataat aatatattat tcttaagatg aaaegtacat cattattatt 1082 
Phe 



aattgggggt aegtaaegta tatatggaat aacgatctag tttgtttaaa tttaaaa 113 9 

<210> 32 
<211> 309 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 32 

Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro 
15 10 15 
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Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr lie Glu Asn Ser 
20 25 30 



Gly Thr Gly Gly Asn Trp lie Ala Leu Pro Gin Lys lie Gly Leu Lys 
35 40 45 



Arg Cys Gly Lys Ser Cys Arg. Leu Arg Trp Leu Asn Tyr Leu Arg Pro 
50 55 60 



Asn lie Lys His Gly Gly Phe Ser Glu Glu Glu Glu Asn lie lie Cys 
65 70- 75 80 



Ser Leu Tyr Leu Thr lie Gly Ser Arg Trp Ser lie lie Ala Ala Gin 

85 90 95 



Leu Pro Gly Arg Thr Asp Asn Asp lie Lys Asn Tyr Trp Asn Thr Arg 
100 105 110 



Leu Lys Lys Lys Leu lie Asn Lys Gin Arg Lys Glu Leu Gin Glu Ala 
115 120 125 



Cys Met Glu Gin Gin Glu Met Met Val Met Met Lys Arg Gin His Gin 
130 135 140 



Gin Gin Gin lie Gin Thr Ser Phe Met Met Arg Gin Asp Gin Thr Met 
145 150 155 160 



Phe Thr Trp Pro Leu His His His Asn Val Gin Val Pro Ala Leu Phe 

165 170 175 



Arg lie Lys Pro Thr Arg Phe Ala Thr Lys Lys Met Leu Ser Gin Cys 
180 185 190 



Ser Ser Arg Thr Trp Ser Arg Ser Lys lie Lys Asn Trp Arg Lys Gin 
195 200 205 



Thr Ser Ser Ser Ser Arg Phe Asn Asp Asn Ala Phe Asp His Leu Ser 
210 215 220 



Phe Ser Gin Leu Leu Leu Asp Pro Asn His Asn His Leu Gly Ser Gly 
225 230 235 240 



Glu Gly Phe Ser Met Asn Ser lie Leu Ser Ala Asn Thr Asn Ser Pro 

245 250 255 



Leu Leu Asn Thr Ser Asn Asp Asn Gin Trp Phe Gly Asn Phe Gin Ala 
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260 265 270 



Glu Thr Val Asn Leu Phe Ser Gly Ala Ser Thr Ser Thr Ser Ala Asp 
275 280 285 



Gin Ser Thr He Ser Trp Glu Asp He Ser Ser Leu Val Tyr Ser Asp 
290 295 300 



Ser Lys Gin Phe Phe 



305 




<210> 


33 


<211> 


922 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(54) . . (647) 


<223> 


G361 


<400> 


33 



tctgtctctc tctctctctt tgtaaatata catatataga taagctcaca tat atg 56 

Met 
1 

gcg act gaa aca tct tct ttg aag etc ttc ggt ata aac eta ctt gaa 104 
Ala Thr Glu Thr Ser Ser Leu Lys Leu Phe Gly He Asn Leu Leu Glu 
5 10 15 

acg acg teg gtt caa aac cag tea teg gaa cca aga ccc gga tec gga 152 
Thr Thr Ser' Val Gin Asn Gin Ser Ser Glu Pro Arg Pro Gly Ser Gly 
20 25 30 

tea gga tec gag tea cgt aag tac gag tgt caa tac tgt tgt aga gag 20 0 

Ser Gly Ser Glu Ser Arg Lys Tyr Glu Cys Gin Tyr Cys Cys Arg Glu 
35 40 45 

ttt get aac tct caa get ctt ggt ggt cac caa aac get cac aag aaa 248 
Phe Ala Asn Ser Gin Ala Leu Gly Gly His Gin Asn Ala His Lys Lys 
50 55 60 65 

gag cgt- cag ctt ctt aaa cgt gca cag atg tta get act cgt ggt ttg 296 
Glu Arg Gin Leu Leu Lys Arg Ala Gin Met Leu Ala Thr Arg Gly Leu 

70 75 80 

cca cgt cat cat aat ttt cac cct cat ace aat ccg ctt etc tec gec 344 
Pro Arg His His Asn Phe His Pro His Thr Asn Pro Leu Leu Ser Ala 
85 90 95 

ttc gcg ccg ctg cct cac etc etc tct cag ccg cat cct ccg ccg cat 392 
Phe Ala Pro Leu Pro His Leu Leu Ser Gin Pro His Pro Pro Pro His 
100 105 110 

atg atg etc tct cct tct tct teg agt tct aag tgg ctt tac ggt gaa 440 
Met Met Leu Ser Pro Ser Ser Ser Ser Ser Lys Trp Leu Tyr Gly Glu 
115 120 125 
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cac atg teg tea caa aac gee gtt ggg tac ttt cat ggt gga agg gga 488 

His Met Ser Ser Gin Asn Ala Val Gly Tyr Phe His Gly Gly Arg Gly 
130 135 140 145 

ctt tac gga ggt ggc atg gag tct atg gee gga gaa gta aag act cat 536 
Leu Tyr Gly Gly Gly Met Glu Ser Met Ala Gly Glu Val Lys Thr'His 

150 155 160 

ggt ggt tct ttg ccg gag atg agg agg ttc gec gga gat agt gat egg 584 
Gly Gly Ser Leu Pro Glu Met Arg Arg Phe Ala Gly Asp Ser Asp Arg 
165 170 175 

agt age gga att aag tta gag aat ggt att ggg ctg gac etc cat tta 632 
Ser Ser Gly lie Lys Leu Glu Asn Gly He Gly Leu Asp Leu His Leu 
180 185 190 

age ctt ggg cca tga atgatt'ataa ttttggccca gtaaagatct gtaaaatact 687 
Ser Leu- Gly Pro 
195 

actaggattt catttttata gagtatgttt ttttccttaa tttcggttga aattggtgaa 747 

tatttttatc tcttacttac caaatctcat atttctatgt atgcgtttgc tttcactttt 807 

tttttttata taattcttct tgtaaaaaat gcaatgtgag ttttcttccc tatcattctg 867 

tcaagctttg gttcaattat ttagtaatcg aataatatag gaatagtgtt gaaag 922 

<210> 34 
<211> 197 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 34 

Met Ala Thr Glu Thr Ser Ser Leu Lys Leu Phe Gly He Asn Leu Leu 
15 10 15 

Glu Thr Thr Ser Val Gin Asn Gin Ser Ser Glu Pro Arg Pro Gly Ser 
20 25 30 

Gly Ser Gly Ser Glu Ser Arg Lys. Tyr Glu Cys Gin Tyr Cys Cys Arg 
35 40 45 

Glu Phe Ala Asn Ser Gin Ala Leu Gly Gly His Gin Asn Ala His Lys 
50 55 60 

Lys Glu Arg Gin Leu Leu Lys Arg Ala Gin Met Leu Ala Thr Arg Gly 
65 70 75 80 

Leu Pro Arg His His Asn Phe His Pro His Thr Asn Pro Leu Leu Ser 

85 90 95 

Ala Phe Ala Pro Leu Pro His Leu Leu Ser Gin Pro His Pro Pro Pro 
100 105 110 
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His Met Met Leu Ser Pro Ser Ser Ser Ser Ser Lys Trp Leu Tyr Gly 
115 120 125 



Glu His Met Ser Ser Gin Asn Ala Val Gly Tyr Phe His Gly Gly^Arg 
130 135 140 



Gly Leu Tyr Gly Gly Gly Met Glu Ser Met Ala Gly Glu Val Lys Thr 
145 150 155 160 



His Gly Gly Ser Leu Pro Glu Met Arg Arg 

165 170 



Arg Ser Ser Gly lie Lys Leu Glu Asn Gly lie Gly Leu Asp Leu His 
180 185 190 



Leu Ser Leu Gly Pro 
195 



Phe Ala Gly Asp Ser Asp 

175 



<210> 


35 


<211> 


420 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (420) 


<223> 


G486 


<400> 


35 



atg aca gac gaa gat aga ttg ttg cca ata gcc aat gta ggg aga ctt 48 
Met Thr Asp Glu Asp Arg Leu Leu Pro He Ala Asn Val Gly Arg Leu 
1 5 10 15 

atg aag caa ate eta cca tea aat gca aag ate tea aaa gaa gca aaa 96 
Met Lys Gin He Leu Pro Ser Asn Ala Lys He Ser Lys Glu Ala Lys 
20 25 30 

caa aca gtt caa gaa tgt gca aca gag ttc ata age ttt gtt aca tgc 144 
Gin Thr Val Gin Glu Cys Ala Thr Glu Phe He Ser Phe Val Thr Cys 
35 40 45 

gaa gca tea gag aag tgc cac agg gag aat egg aag acg gtg aat gga 192 
Glu Ala Ser Glu Lys Cys His Arg Glu Asn Arg Lys Thr Val Asn Gly 
50 55 60 

gac gac ate tgg tgg get etc age act etc ggc etc gat aac tat get 240 
Asp Asp He Trp Trp Ala Leu Ser Thr Leu Gly Leu Asp Asn Tyr Ala 
65 70 75 80 

gac gcc gtg ggt agg cat ctt cac aag tac cgt gaa gcc gag aga gaa 288 
Asp Ala Val Gly Arg His Leu His Lys Tyr Arg Glu Ala Glu Arg Glu 

85 90 95 

aga act gag cac aac aaa ggt age aat gat agt ggg aat gag aaa gaa 33 6 

Arg Thr Glu His Asn Lys Gly Ser Asn Asp Ser Gly Asn Glu Lys Glu 
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100 105 110 

acc aac act aga agt gat gta cag aac caa teg aca aaa ttt att aga 384 
Thr Asn Thr Arg Ser Asp Val Gin Asn Gin Ser Thr Lys Phe lie Arg 
115 120 125 

gtt gtt gag aag gga age age tec teg gec cgt tga 420 
Val Val Glu Lys Gly Ser Ser Ser Ser Ala Arg 
130 135 



<210> 36 

<211> 139 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 36 - 

Met Thr Asp Glu Asp Arg Leu Leu Pro lie Ala Asn Val Gly Arg Leu 
1 .5 10 15 



Met Lys Gin lie Leu Pro Ser Asn Ala Lys lie Ser Lys Glu Ala Lys 
20 25 30 



Gin Thr Val Gin Glu Cys Ala Thr Glu Phe He Ser Phe Val Thr Cys 
35 40 45 



Glu Ala Ser Glu Lys Cys His Arg Glu Asn Arg Lys Thr Val Asn Gly 
50 55 60 



Asp Asp He Trp Trp Ala Leu Ser Thr Leu Gly Leu Asp Asn Tyr Ala 
65 70 75 80 



Asp Ala Val Gly Arg His Leu His Lys Tyr Arg Glu Ala Glu Arg Glu 

85 90 95 



Arg Thr Glu His Asn Lys Gly Ser Asn Asp Ser Gly Asn Glu Lys Glu 
100 ' 105 110 



Thr Asn Thr Arg Ser Asp Val Gin Asn Gin Ser Thr Lys Phe He Arg 
115 120 125 



Val Val Glu Lys Gly Ser Ser Ser Ser Ala Arg 
130 135 



<210> 


37 


<211> 


1707 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(98) . . (1444) 


<223> 


G748 
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<400> 37 

ccacgcgtcc gcactctccc aaatctctct tctttaacaa caaaaaaaaa atcacagaga 60 

catagagaga agaagacgga acagaggctc caaaaaa atg atg atg gag act aga 115 

Met Met Met Glu Thr Arg 
1 5 

gat cca get att aag ctt ttc ggt atg aaa ate cct ttt ccg teg gtt 163 
Asp Pro Ala lie Lys Leu Phe Gly Met Lys lie Pro Phe Pro Ser Val 
10 15 20 

ttt gaa teg gca gtt acg gtg gag gat gac gaa gaa gat gac tgg age 211 
Phe Glu Ser Ala Val Thr Val Glu Asp Asp Glu Glu Asp Asp Trp Ser 

25 . " . 30 35 

ggc gga gat gac aaa tea cca gag aag gta act cca gag tta tea gat 25 9 

Gly Gly Asp Asp Lys Ser Pro Glu Lys Val Thr Pro Glu Leu Ser Asp 
40 ,45 50 

aag aac aac aac aac tgt aac gac aac agt ttt aac aat teg aaa ccc 3 07 

Lys Asn Asn Asn Asn Cys Asn Asp Asn Ser Phe Asn Asn Ser Lys Pro 
55 60 65 70 

gaa acc ttg gac aaa gag gaa gcg aca tea act gat cag ata gag agt 3 55 

Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser Thr Asp Gin lie Glu Ser 

75 80 85 

agt gac acg cct gag gat aat cag cag acg aca cct gat ggt aaa acc 4 03 

Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr Thr Pro Asp Gly Lys Thr 
90 95 100 

eta aag aaa ccg act aag att eta ccg tgt ccg aga tgc aaa age atg 451 
Leu Lys Lys Pro Thr Lys lie Leu Pro Cys Pro Arg Cys Lys Ser Met 
105 110 115 

gag acc aag ttc tgt tat tac aac aac tac aac ata aac cag cct cgt 4 99 

Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr Asn lie Asn Gin Pro Arg 
120 125 130 

cat ttc tgc aag get tgt cag aga tat tgg act get gga ggg act atg 547 
His Phe Cys Lys Ala Cys Gin Arg Tyr Trp Thr Ala Gly Gly Thr Met 
135 140 145 150 

agg aat gtt cct gtg ggg gca gga cgt cgt aag aac aaa age tea tct 595 
Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Asn Lys Ser Ser Ser 

155 160 165 

tct cat tac cgt cac ate act att tec gag get ctt gag get gcg agg 64 3 

Ser His Tyr Arg His lie Thr lie Ser Glu Ala Leu Glu Ala Ala Arg 
170 175 180 

ctt gac ccg ggc tta cag gca aac aca agg gtc ttg agt ttt ggt etc 691 
Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg Val Leu Ser Phe Gly Leu 
185 190 195 

gaa get cag cag cag cac gtt get get ccc atg aca cct gtt atg aag 73 9 

Glu Ala Gin Gin Gin His Val Ala Ala Pro Met Thr Pro Val Met Lys 
200 205 210 

eta caa gaa gat caa aag gtc tea aac ggt get agg aac agg ttt cac 787 
Leu Gin Glu Asp Gin Lys Val Ser Asn Gly Ala Arg Asn Arg Phe His 
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215 220 225 230 

ggg tta gcg gat caa egg ctt gta get egg gta gag aat gga gat gat 835 
Gly Leu Ala Asp Gin Arg Leu Val Ala Arg Val Glu Asn Gly Asp Asp 

235 240 245 

tgc tea age gga tec tct gtg ace ace tct aac aat cac tea gtg gat 883 
Cys Ser Ser Gly Ser .Ser Val Thr Thr Ser Asn Asn His Ser Val Asp 
250 255 260 

gaa tea aga gca caa age ggc agt gtt gtt gaa gca caa atg aac aac 931 
Glu Ser Arg Ala Gin Ser Gly Ser Val Val Glu Ala Gin Met Asn Asn 
265 270 275 

aac aac aac aat aac atg aat ggt tat get tgc ate cca ggt gtt cca 979 
Asn Asn Asn Asn- Asn Met- Asn Gly Tyr Ala Cys lie Pro Gly Val Pro 
280 285 290 

tgg cct tac acg tgg aat cca gcg atg cct cca cca ggt ttt tac ccg 1027 
Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro Pro Pro Gly Phe Tyr Pro 
295 300 305 310 

cct cca ggg tat cca atg ccg ttt tac cct tac tgg acc ate cca atg 1075 
Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro Tyr Trp Thr lie Pro Met 

315 320 325 

eta cca ccg cat caa tec tea teg cct ata age caa aag tgt tea aat 1123 
Leu Pro Pro His Gin Ser Ser Ser Pro lie Ser Gin Lys Cys Ser Asn 
330 335 340 

aca aac tct ccg act etc gga aag cat ccg aga gat gaa gga tea teg 1171 
Thr Asn Ser Pro Thr Leu Gly Lys His Pro Arg Asp Glu Gly Ser Ser 
345 350 355 

aaa aag gac aat gag aca gag cga aaa cag aag gec ggg tgc gtt ctg 1219 
Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin Lys Ala Gly Cys Val Leu 
360 365 370 

gtc ccg aaa acg ttg aga ata gat gat cct aac gaa gca gca aag age 1267 
Val Pro Lys Thr Leu Arg lie Asp Asp Pro Asn Glu Ala Ala Lys Ser 
375 380 385 390 

teg ata tgg aca aca ttg gga ate aag aac gag gcg atg tgc aaa gee 1315 
Ser lie Trp Thr Thr Leu Gly lie Lys Asn Glu Ala Met Cys Lys Ala 

395 400 405 

99t 99*. atg ttc aaa ggg ttt gat cat aag aca aag atg tat aac aac 1363 
Gly Gly Met Phe Lys Gly Phe Asp His Lys Thr Lys Met Tyr Asn Asn 
410 415 420 

gac aaa get gag aac tec cct gtt ctt tct get aac cct get get eta 1411 
Asp Lys Ala Glu Asn Ser Pro Val Leu Ser Ala Asn Pro Ala Ala Leu 
425 430 435 

tea aga tea cac aat ttc cat gaa cag att tag agttacatat gtatatgtat 14 64 
Ser Arg Ser His Asn .Phe His Glu Gin He 
440 445 

atatgtatga ttgattgtat gtatagatga tactggagaa tgatgagttt ttgagaatca 1524 

aactcttttc ttctttctag tgattgeett tattccttta catgttttgg ttctctgtac 1584 

actatttgat ttaccttttt tactttcttt cttcatttgt caggaaatgt tggaagataa 1644 
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cattaatggt aaaaagttgg tgtggaccgt tgttgcgttg gcatttcaaa aaaaaaaaaa 1704 
aaa 1707 

<210> 38 

<211> 448 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 38 

Met Met Met Glu Thr Arg Asp Pro Ala lie Lys Leu Phe Gly Met Lys 
1 5-10 15 

lie Pro Phe Pro Ser Val Phe Glu Ser Ala Val Thr Val Glu Asp Asp 
20 25 30 

Glu Glu Asp Asp Trp Ser Gly Gly Asp Asp Lys Ser Pro Glu Lys Val 
35 40 45 

Thr Pro Glu Leu Ser Asp Lys Asn Asn Asn Asn. Cys Asn Asp Asn Ser 
50 55 60 

Phe Asn Asn Ser Lys Pro Glu Thr Leu Asp Lys Glu Glu Ala Thr Ser 
65 70 75 80 

Thr Asp Gin lie Glu Ser Ser Asp Thr Pro Glu Asp Asn Gin Gin Thr 

85 90 95 

Thr Pro Asp Gly Lys Thr Leu Lys Lys Pro Thr Lys lie Leu Pro Cys 
100 105 110 

Pro Arg Cys Lys Ser Met Glu Thr Lys Phe Cys Tyr Tyr Asn Asn Tyr 
115 120 125 

Asn lie Ash Gin Pro Arg His Phe Cys Lys Ala Cys Gin Arg Tyr Trp 
130 135 140 

Thr Ala Gly Gly Thr Met Arg Asn Val Pro Val Gly Ala Gly Arg Arg 
145 150 155 160 

Lys Asn Lys Ser Ser Ser Ser His Tyr Arg His lie Thr lie Ser Glu 

165 170 175 

Ala Leu Glu Ala Ala Arg Leu Asp Pro Gly Leu Gin Ala Asn Thr Arg 
180 185 190 



Val Leu Ser 
195 



Phe Gly Leu Glu Ala Gin Gin Gin His Val Ala Ala Pro 

200 205 
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Met Thr Pro Val Met Lys Leu Gin Glu Asp Gin Lys Val Ser Asn Gly 
210 215 220 



Ala Arg Asn Arg Phe His Gly Leu Ala Asp Gin Arg Leu Val Ala Arg 
225 230 235 240 



Val Glu Asn Gly Asp Asp Cys Ser Ser Gly Ser Ser Val Thr Thr Ser 

245 250 255 



Asn Asn His Ser Val Asp Glu Ser Arg Ala Gin Ser Gly Ser Val Val 
260. - , ' 265 270 



Glu Ala Gin Met Asn Asn Asn Asn Asn Asn Asn Met Asn Gly Tyr Ala 
275 ' 280 285 



Cys lie Pro Gly Val Pro Trp Pro Tyr Thr Trp Asn Pro Ala Met Pro 
290 295 300 



Pro Pro Gly Phe Tyr Pro Pro Pro Gly Tyr Pro Met Pro Phe Tyr Pro 
305 310 315 320 



Tyr Trp Thr lie Pro Met Leu Pro Pro His Gin Ser Ser Ser Pro lie 

325 330 335 



Ser Gin Lys Cys Ser Asn Thr Asn Ser Pro Thr Leu Gly Lys His Pro 
340 345 350 



Arg Asp Glu Gly Ser Ser Lys Lys Asp Asn Glu Thr Glu Arg Lys Gin 
355 360 365 



Lys Ala Gly Cys Val Leu Val Pro Lys Thr Leu Arg lie Asp Asp Pro 
370 375 380 



Asn Glu Ala Ala Lys Ser Ser lie Trp Thr Thr Leu Gly lie Lys Asn 
385 390 395 400 



Glu Ala Met Cys Lys Ala Gly Gly Met Phe Lys Gly Phe Asp His Lys 

405 410 415 



Thr Lys Met Tyr Asn Asn Asp Lys Ala Glu Asn Ser Pro Val Leu Ser 
420 425 430 



Ala Asn Pro Ala Ala Leu Ser Arg Ser His Asn Phe His Glu Gin lie 
435 440 445 



<210> 39 
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<211> 1095 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (180) . . (917) 

<223> G994 

<400> 39 

tgtatatata gttagttagt tgagataaac ttggttacca cttttgtgtg gtctttcttt 60 

ttctttttct ccattttcca tttatcgacc ccttgggtgt agctaattac tttcgcgatt 120 

ttcaaatcca ataaagtttt aa.tttgatga agcttttttt aaaccatata atataaata 179 

atg ggt ggt cgt aaa cca tgt tgt gat gag gtt gga tta aga aag ggt 227 
Met Gly Gly Arg Lys Pro Cys Cys Asp Glu Val Gly Leu Arg Lys Gly 
15 10 15 

cca tgg aca gtg gaa gaa gat ggg aaa eta gtt gat ttc tta agg gca 275 
Pro Trp Thr Val Glu Glu Asp Gly Lys Leu Val Asp Phe Leu Arg Ala 
20 25 30 

cgt ggc aac tgc ggt ggt ggt gga gga gga tgg tgc tgg aga gac gtg 323 
Arg Gly Asn Cys Gly Gly Gly Gly Gly Gly Trp Cys Trp Arg Asp Val 
35 40 45 

cca aaa ctg gcg ggg eta agg agg tgt ggc aaa agt tgc cgt etc egg 3 71 

Pro Lys Leu Ala Gly Leu Arg Arg Cys Gly Lys Ser Cys Arg Leu Arg 
50 55 60 

tgg act aat tat etc egg cca gat etc aag aga ggt ctt ttt act gaa 419 
Trp Thr Asn Tyr Leu Arg Pro Asp Leu Lys Arg Gly Leu Phe Thr Glu 
65 70 75 80 

gaa gaa ate caa eta gtc att gat ctt cat get cgc ctt ggc aat aga 467 
Glu Glu lie Gin Leu Val lie Asp Leu His Ala Arg Leu Gly Asn Arg 

85 90 95 

tgg teg aag att gca gtg gag tta cca gga aga aca gac aac gat ate 515 
Trp Ser Lys lie Ala Val Glu Leu Pro Gly Arg Thr Asp Asn Asp lie 
100 105 110 

aaa aat tat tgg aac act cat ata aag agg aag ctt ata aga atg ggt 563 
Lys Asn Tyr Trp Asn Thr His lie Lys Arg Lys Leu lie Arg Met Gly 
115 120 125 

att gat cca aac aca cat cgt cga ttt gac caa caa aaa gtc aac gag 611 
lie Asp Pro Asn Thr His Arg Arg Phe Asp Gin Gin Lys Val Asn Glu 
130 135 140 

gag gaa acg ata ttg gtc aac gat cca aag cct ctg tct gag acc gag 659 
Glu Glu Thr lie Leu Val Asn Asp Pro Lys Pro Leu Ser Glu Thr Glu 
145 150 155 160 

gta tct gtt get ttg aag aat gac acg tea gca gtg tta tea gga aat 707 
Val Ser Val Ala Leu. Lys Asn Asp Thr Ser Ala Val Leu Ser Gly Asn 

165 170 175 

eta aac caa ttg get gac gtg gac ggt gat gat cag ccg tgg age ttt 755 
Leu Asn Gin Leu Ala Asp Val Asp Gly Asp Asp Gin Pro Trp Ser Phe 
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180 185 190 

eta atg gaa aat gac gaa gga gga ggt ggc gac gec gec gga gag ctt 803 
Leu Met Glu Asn Asp Glu Gly Gly Gly Gly Asp Ala Ala Gly Glu Leu 
195 200 205 

acg atg eta ttg tec ggt gac att acg tea tea tgt tct tct teg tea 851 
Thr Met Leu Leu Ser Gly Asp lie Thr Ser Ser Cys Ser Ser Ser Ser 
210 215 220 

tct ttg tgg atg aag tat gga gaa ttc gga tac gaa gat tta gaa ctt 899 
Ser Leu Trp Met Lys Tyr Gly Glu Phe Gly Tyr Glu Asp Leu Glu Leu 
225 230 235 240 

gga tgt ttc gat gtt tag agattcaagt atgtttaatt aggcegtagg 947. 
Gly Cys Phe Asp Val 

245 

ttgattaatc ataaggttca ttgacttcat tctagaattg tgtagttgga ccagtataaa 1007 

gaatcaaagt tatgaaacat tgtaatttga tttccaaatt aatctaatga ataaatgtgc 1067 

tttgcaaaaa aaaaaaaaaa aaaaaaaa 10 95 

<210> 40 
<211> 245 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 40 

Met Gly Gly Arg Lys Pro Cys Cys Asp Glu Val Gly Leu Arg Lys Gly 
15 10 15 

Pro Trp Thr Val Glu Glu Asp Gly Lys Leu Val Asp Phe Leu Arg Ala 
20 25 30 

Arg Gly Asn Cys Gly Gly Gly Gly Gly Gly Trp Cys Trp Arg Asp Val 
35 40 45 

Pro Lys Leu Ala Gly Leu Arg Arg Cys Gly Lys Ser Cys Arg Leu Arg 
50 55 60 

Trp Thr Asn Tyr Leu Arg Pro Asp Leu Lys Arg Gly Leu Phe Thr Glu 
65 70 75 80 

Glu Glu lie Gin Leu Val lie Asp Leu His Ala Arg Leu Gly Asn Arg 

85 90 95 

Trp Ser Lys lie Ala Val Glu Leu Pro Gly Arg Thr Asp Asn Asp lie 
100 105 110 

Lys Asn Tyr Trp Asn Thr His lie Lys Arg Lys Leu lie Arg Met Gly 
115 120 125 
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lie Asp Pro Asn Thr His Arg Arg Phe Asp Gin Gin Lys Val Asn Glu 
130 135 140 



Glu Glu Thr lie Leu Val Asn Asp Pro Lya Pro Leu Ser Glu Thr'Glu 
145 150 155 160 

Val Ser Val Ala Leu Lys Asn Asp Thr Ser Ala Val Leu Ser Gly Asn 

165 170 175 



Leu Asn Gin Leu Ala Asp Val Asp Gly Asp Asp Gin Pro Trp Ser Phe 

180 185 190 

Leu Met Glu Asn Asp Glu Gly Gly Gly Gly Asp Ala Ala Gly Glu Leu 
195 200 205 



Thr Met Leu Leu Ser Gly Asp lie Thr Ser Ser Cys Ser Ser Ser Ser 
210 215 220 



Ser Leu Trp Met Lys Tyr Gly Glu Phe Gly Tyr Glu Asp Leu Glu Leu 
225 230 235 240 



Gly Cys Phe Asp Val 

245 



<210> 


41 


<211> 


965 


<212> 


DNA 


<213> 


Arabididopsis 


<220> 




<221> 


CDS 


<222> 


(56) . . (667) 


<223> 


G1335 


<400> 


41 



ttttttttta aaagatttag agagaaaagt gagttattaa gagattccaa tcaaa atg 5 8 

Met 
1 

age gga gac aac ggc ggt ggt gag agg cgc aaa ggc tec gtc aag tgg 106 
Ser Gly Asp Asn Gly Gly Gly Glu Arg Arg Lys Gly Ser Val Lys Trp 
5 10 15 

ttt gat acc cag aag ggt ttc ggc ttc ate act cct gac gac ggt ggc 154 
Phe Asp Thr Gin Lys Gly Phe Gly Phe lie Thr Pro' Asp Asp Gly Gly 
20 25 30 

gac gat etc ttc gtt cac cag tec tec ate aga tct gag ggt ttc cgt 202 
Asp Asp Leu Phe Val His Gin Ser Ser lie Arg Ser Glu Gly Phe Arg 
35 40 45 

age etc get gec gaa gaa gec gta gag ttc gag gtt gag ate gac aac 250 
Ser Leu Ala Ala Glu Glu Ala Val Glu Phe Glu Val Glu lie Asp Asn 
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50 55 60 65 

aac aac cgt ccc aag gcc ate gat gtt tct gga ccc gac ggc get ccc 298 
Asn Asn Arg Pro Lys Ala lie Asp Val Ser Gly Pro Asp Gly Ala Pro 

70 75 80 

gtc caa gga aac age ggt ggt ggt tea tct ggc gga cgc ggc ggt ttc 346 
Val Gin Gly Asn Ser Gly Gly Gly Ser Ser Gly Gly Arg Gly Gly Phe 
85 90 95 

ggt gga gga aga gga ggt gga cgc gga tct gga ggt gga tac ggc ggt 394 
Gly Gly Gly Arg Gly Gly Gly Arg Gly Ser Gly Gly Gly Tyr Gly Gly 
100 105 110 

ggc ggt ggt gga tac gga gga aga gga ggt ggt ggt cga gga ggc age 442 
Gly Gly Gly Gly Tyr Gly Gly Arg Gly Gly Gly Gly Arg Gly Gly Ser 
115 - " 120 125 

gac tgc tac aag tgt ggt gag ccc ggt cac atg gcg aga. gac tgt tct 4 90 

Asp Cys Tyr Lys Cys Gly Glu Pro Gly His Met Ala Arg' Asp Cys Ser 
130 135 140 145 

gaa ggc ggt gga ggt tac gga gga ggc ggc ggt ggc tac gga ggt gga 538 
Glu Gly Gly Gly Gly Tyr Gly Gly Gly Gly Gly Gly Tyr Gly Gly Gly 

150 155 160 

ggc gga tac ggc gga gga ggt ggt ggt tac gga ggt ggt ggc cgt gga 586 
Gly Gly Tyr Gly Gly Gly Gly Gly Gly Tyr Gly Gly Gly Gly Arg Gly 
165 170 175 

ggt ggt ggc ggc ggg gga age tgc tac age tgt ggc gag teg gga cat 634 
Gly Gly Gly Gly Gly Gly Ser Cys Tyr Ser Cys Gly Glu Ser Gly His 
180 185 190 

ttc gcc agg gat tgc ace age ggt gga cgt taa aaccaacgcc ggttacgcgg 687 
Phe Ala Arg Asp Cys Thr Ser Gly Gly Arg 
195 200 

tggagaagag tgagttggtt atctcacaag tgatcggttc tttctcccgc cgccttctat 74 7 

ctctctatta tccacttttt gcttattatg atggatctct atctttgtta gttggttttt 807 

tcttgatggt tteggattag gactcttctt ttggttttgc tacttatggt tggttttatt 867 

tatggtactt gtgatatggg tgaaatgetc tacttgttgc tctgtttcaa gtgttcataa 927 

tatgegaaca aatattctgg gttttgtttc aaaaaaaa 965 

<210> 42 
<211> 203 
<212> PRT 

<213> Arabididopsis thaliana 
<400> 42 

Met Ser Gly Asp Asn Gly Gly Gly Glu Arg Arg Lys Gly Ser Val Lys 
15 10 15 

Trp Phe Asp Thr Gin Lys Gly Phe Gly Phe lie Thr Pro Asp Asp Gly 
20 25 30 
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Gly Asp Asp Leu Phe Val His Gin Ser Ser lie Arg Ser Glu Gly Phe 
35 40 45 

Arg Ser Leu Ala Ala Glu Glu Ala Val Glu Phe Glu Val Glu He Asp 
50 55 60 

Asn Asn Asn Arg Pro Lys Ala He Asp Val Ser Gly Pro Asp Gly Ala 
65 70 75 80 

Pro Val Gin Gly Asn Ser Gly Gly Gly Ser Ser Gly Gly Arg Gly Gly 

85 90 95 

Phe Gly Gly Gly Arg Gly Gly Gly Arg Gly Ser Gly Gly Gly Tyr Gly 
100 105 110 

Gly Gly Gly Gly Gly Tyr Gly Gly Arg Gly Gly Gly Gly Arg Gly Gly 
115 120 125 

Ser Asp Cys Tyr Lys Cys Gly Glu Pro Gly His Met Ala Arg Asp Cys 
130 135 140 

Ser Glu Gly Gly Gly Gly Tyr Gly Gly Gly Gly Gly Gly Tyr Gly Gly 
145 150 155 160 

Gly Gly Gly Tyr Gly Gly Gly Gly Gly Gly Tyr Gly Gly Gly Gly Arg 

165 170 175 

Gly Gly Gly Gly Gly Gly Gly Ser Cys Tyr Ser Cys Gly Glu Ser Gly 
180 185 190 

His Phe Ala Arg Asp Cys Thr Ser Gly Gly Arg 
195 200 

<210> 43 

<211> 1554 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (137) . . (1285) 

<223> G562 

<400> 43 

atttgaattt ctgggtttct ctctgtttaa gcttcttctt cttcatcttc tgcttacgtt 60 

tcttcttcaa ggagctttcg gattcttgta gaaagagtca ttgttctctt gagtgggaaa 120 

ccttgaaacc attcct atg gga aat age age gag gaa cca aag cct cct acc 172 

Met Gly Asn Ser Ser Glu Glu Pro Lys Pro Pro Thr 
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15 10 

aaa tea gat aaa cca tct tea ccc ccg gtg gat caa aca aat gtt cat 220 
Lys Ser Asp Lys Pro Ser Ser Pro Pro Val Asp Gin Thr Asn Val His 
15 20 25 

gtc tac cct gat tgg gca get atg cag gca tat tat ggt cca aga gta 268 
Val Tyr Pro Asp Trp Ala Ala Met Gin Ala Tyr Tyr Gly Pro Arg Val 
30 35 40 

gca atg cct cct tat tac aat tea get atg get gca tct ggt cat cct 316 
Ala Met Pro Pro Tyr Tyr Asn Ser Ala Met Ala Ala Ser Gly His Pro 
45 50 55 60 

cct cct cct tac atg tgg aat cct cag cat atg atg tea cca tct gga 364 
Pro Pro Pro Tyr Met Trp Asn Pro Gin His Met Met Ser Pro Ser Gly 

65 70 75 

gca ccc tat get get gtt tat cct cat gga gga gga gtt tac get cat 412 
Ala Pro Tyr Ala Ala Val Tyr Pro His Gly Gly Gly Val Tyr Ala His 
80 85 90 

ccc ggt att ccc atg gga tea ctg cct caa ggt caa aag gat cca cct 4 60 

Pro Gly lie Pro Met Gly Ser Leu Pro Gin Gly Gin Lys Asp Pro Pro 
95 100 105 

tta aca act ccg ggg acg ctt ttg age ate gac act cct act aaa tct 508 
Leu Thr Thr Pro Gly Thr Leu Leu Ser lie Asp Thr Pro Thr Lys Ser 
110 115 120 

aca ggg aac aca gac aat gga ttg atg aag aag ctg aaa gag ttt gat 556 
Thr Gly Asn Thr Asp Asn Gly Leu Met Lys Lys Leu Lys Glu Phe Asp 
125 130 135 140 

ggg ctt get atg tct eta gga aat ggg aat cct gaa aat ggt gca gat 604 
Gly Leu Ala Met Ser Leu Gly Asn Gly Asn Pro Glu Asn Gly Ala Asp 

145 150 155 

gaa cat aaa cga tea egg aac age tea gaa act gat ggt tct act gat 652 
Glu His Lys Arg Ser Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr Asp 
160 165 170 

gga agt gat ggg aat aca act ggg gca gat gaa ccg aaa ctt aaa aga 700 
Gly Ser Asp Gly Asn Thr Thr Gly Ala Asp Glu Pro Lys Leu Lys Arg 
175 180 185 

agt cga gag gga act cca aca aaa gat ggg aaa caa ttg gtt caa get 74 8 

Ser Arg Glu Gly Thr Pro Thr Lys Asp Gly Lys Gin Leu Val Gin Ala 
190 195 200 

age tea ttt cat tct gtt tct ccg tea agt ggt gat acc ggc gta aaa 796 
Ser Ser Phe His Ser Val Ser Pro Ser Ser Gly Asp Thr Gly Val Lys 
205 210 215 220 

etc att caa gga tct gga get ata etc tct cct ggt gta agt gca aat 844 
Leu He Gin Gly Ser Gly Ala lie Leu Ser Pro Gly Val Ser Ala Asn 

225 230 235 

tec aac ccc ttc atg tea caa tct tta gec atg gtt cct cct gaa act 892 
Ser Asn Pro Phe Met Ser Gin Ser Leu Ala Met Val Pro Pro Glu Thr 
240 245 250 

tgg ctt cag aac gag aga gaa ctg aaa egg gag cga agg aaa cag tct 940 
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Trp Leu Gin Asn Glu Arg Glu Leu Lys Arg Glu Arg Arg Lys Gin Ser 
255 260 265 

aat aga gaa tct get aga agg tea aga tta agg aaa cag gee gag aca 988 
Asn Arg Glu Ser Ala Arg Arg Ser Arg Leu Arg Lys Gin Ala Glu Thr 
270 275 280 

gaa gaa ctt get agg aaa gtg gaa gee ttg aca gee gaa aac atg gca 1036 
Glu Glu Leu Ala Arg Lys Val Glu Ala Leu Thr Ala Glu Asn Met Ala 
285 290 . 295 300 

tta aga tct gaa eta aac caa ctt aat gag aaa tct gat aaa eta aga 1084 
Leu Arg Ser Glu Leu Asn Gin Leu Asn Glu Lys Ser Asp Lys Leu Arg 

305 310 315 

gga gca aat gca ace ttg .ttg. gac aaa ctg aaa tgc teg gaa ccc gaa 1132 
Gly Ala Asn Ala Thr Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro Glu 
320 325 330 

aag aga gtc ccc gca aat atg ttg tct aga gtt aag aac tea gga get 118 0 

Lys Arg Val Pro Ala Asn Met Leu Ser Arg Val Lys Asn Ser Gly Ala 
335 340 345 

gga gat aag aac aag aac caa gga gac aat gat tct aac tct aca age 122 8 

Gly Asp Lys Asn Lys Asn Gin Gly Asp Asn Asp Ser Asn Ser Thr Ser 
350 355 360 

aaa ttc cat caa ctg etc gat acg aag cct cga get aaa gca gta get 127 6 

Lys Phe His Gin Leu Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala 
365 370 375 380 

gca ggc tga atcgatggta attcatgtcg atttctactt aatttgtcga 13 25 

Ala Gly 



cataaacaaa 


gaaaataagt 


gctactaatt 


tcagaaaaac 


ttgatagata 


gatagtatag 


1385 


tagagagaga 


gagagagaga 


gaggtgtgat 


gattattgat 


ctataaattt 


teggagagag 


1445 


agagggagaa 


agagaaactt 


ttcctccaga 


tgaaaatttg gtgttatggt 


ttgttactgt 


1505 


taatatagag 


aggcttttct 


ttttttataa 


aatggcttcc 


tttgttgca 




1554 



<210> 44 
<211> 382 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 44 

Met Gly Asn Ser Ser Glu Glu Pro Lys Pro Pro Thr Lys Ser Asp Lys 
15 10 15 



Pro Ser Ser Pro Pro Val Asp Gin Thr Asn Val His Val Tyr Pro Asp 
20 25 30 



Trp Ala Ala Met Gin Ala Tyr Tyr Gly Pro Arg Val Ala Met Pro Pro 
35 40 45 
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Tyr Tyr Asn Ser Ala Met Ala Ala Ser Gly His Pro Pro Pro Pro Tyr 
50 55 60 



Met Trp Asn Pro Gin His Met Met Ser Pro Ser Gly Ala Pro Tyr Ala 
65 70 75 80 



Ala Val Tyr Pro His Gly Gly Gly Val Tyr Ala His Pro Gly He Pro 

85 90 95 

Met Gly Ser Leu Pro Gin Gly Gin Lys Asp Pro Pro Leu Thr Thr Pro 
100 105 110 

Gly Thr Leu Leu Ser He Asp' Thr Pro Thr Lys Ser Thr Gly Asn Thr 
115 120 125 

Asp Asn Gly Leu Met Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met 
130 135 140 

Ser Leu Gly Asn Gly Asn Pro Glu Asn Gly Ala Asp Glu His Lys Arg 
145 150 155 160 

Ser Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr Asp Gly Ser Asp Gly 

165 170 175 

Asn Thr Thr Gly Ala Asp Glu Pro Lys Leu Lys Arg Ser Arg Glu Gly 
180 185 190 

Thr Pro Thr Lys Asp Gly Lys Gin Leu Val Gin Ala Ser Ser Phe His 
195 200 205 

Ser Val Ser Pro Ser Ser Gly Asp Thr Gly Val Lys Leu He Gin Gly 
210 215 220 

Ser Gly Ala He Leu Ser Pro Gly Val Ser Ala Asn Ser Asn Pro Phe 
225 • 230 235 240 

Met Ser Gin Ser Leu Ala Met Val Pro Pro Glu Thr Trp Leu Gin Asn 

245 250 255 

Glu Arg Glu Leu Lys Arg Glu Arg Arg Lys Gin Ser Asn Arg Glu Ser 
260 265 270 

Ala Arg Arg Ser Arg Leu Arg Lys Gin Ala Glu Thr Glu Glu Leu Ala 
275 280 285 

Arg Lys Val Glu Ala Leu Thr Ala Glu Asn Met Ala Leu Arg Ser Glu 
290 295 300 
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Leu Asn Gin Leu Asn Glu Lys Ser Asp Lys Leu Arg Gly Ala Asn Ala 

305 310 315 320 

Thr Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro Glu Lys Arg Val Pro 

325 330 335 

Ala Asn Met Leu Ser Arg Val Lys Asn Ser Gly Ala Gly Asp Lys Asn 

340 345 350 



Lys Asn Gin Gly Asp Asn Asp Ser Asn Ser Thr Ser Lys Phe His Gin 
355 360 365 



Leu Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala Ala Gly 
370 375 380 



<210> 


45 


<211> 


513 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(1) . . (513) 


<223> 


G736 


<400> 


45 



atggcg act caa gat tct caa ggg att aaa etc ttt ggc aaa act att 48 
Met Ala Thr Gin Asp Ser Gin Gly lie Lys Leu Phe Gly Lys Thr lie 
15 10 15 

gca ttt aac act cga aca ata aaa aat gaa gaa gag aca cac ccg ccg 96 
Ala Phe Asn Thr Arg Thr lie Lys Asn Glu Glu Glu Thr His Pro Pro 
20 25 30 

gag caa gaa gec aca ata gec gtt aga tea tea tea tea teg gat ctg 144 
Glu Gin Glu Ala Thr lie Ala Val Arg Ser Ser Ser Ser Ser Asp Leu 
35 40 45 

acg gec gag aag cgt ccg gat aag ate ata gca tgt cca aga tgc aag 192 
Thr Ala Glu Lys Arg Pro Asp Lys lie lie Ala Cys Pro Arg Cys Lys 
50 55 60 

age atg gag aca aag ttc tgt tac ttc aac aac tac aac ggt aat cag 240 
Ser Met Glu Thr Lys Phe Cys Tyr Phe Asn Asn Tyr Asn Gly Asn Gin 
65 70 75 80 

cct cga cac ttt tgt aaa ggc tgc cac cgt tac tgg acc gec ggt ggt 288 
Pro Arg His Phe Cys Lys Gly Cys His Arg Tyr Trp Thr Ala Gly Gly 

85 90 95 

gca etc egg aac gtt ccc gtc ggc gec ggt cgt egg aag tec aaa cca 336 
Ala Leu Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Ser Lys Pro 
100 105 110 

cct ggt cgt gtc gtg gtt ggt atg ctt gga gat gga aat ggt gtt cgc 3 84 

Pro Gly Arg Val Val Val Gly Met Leu Gly Asp Gly Asn Gly Val Arg 
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caa gtc gag ctt ata aat ggc ttg etc gtt gag gag tgg cag cat gec 432 
Gin Val Glu Leu lie Asn Gly Leu Leu Val Glu Glu Trp Gin His Ala 
130 135 140 

gca gec gca get cac ggt agt ttc egg cat gat ttt ccc atg aag egg 480 
Ala Ala Ala Ala His Gly Ser Phe Arg His Asp Phe Pro Met Lys Arg 
145 150 155 160 

etc egg tgt tac tec gac ggt caa teg tgc tga 513 
Leu Arg Cys Tyr Ser Asp Gly Gin Ser Cys 

165 170 

<210> 46 
<211> 170 ■ 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 46 

Met Ala Thr Gin Asp Ser Gin Gly lie Lys Leu Phe Gly Lys Thr lie 
1 5 10 15 

Ala Phe Asn Thr Arg Thr lie Lys Asn Glu Glu Glu Thr His Pro Pro 
20 25 30 

Glu Gin Glu Ala Thr lie Ala Val Arg Ser Ser Ser Ser Ser Asp Leu 
35 40 45 

Thr Ala Glu Lys Arg Pro Asp Lys lie lie Ala Cys Pro Arg Cys Lys 
50 55 60 

Ser Met Glu Thr Lys Phe Cys Tyr Phe Asn Asn Tyr Asn Gly Asn Gin 
65 70 75 80 

Pro Arg His Phe Cys Lys Gly Cys His Arg Tyr Trp Thr Ala Gly Gly 

85 90 95 

Ala Leu Arg Asn Val Pro Val Gly Ala Gly Arg Arg Lys Ser Lys Pro 
100 105 110 

Pro Gly Arg Val Val Val Gly Met Leu Gly Asp Gly Asn Gly Val Arg 
115 120 125 

Gin Val Glu Leu lie Asn Gly Leu Leu Val Glu Glu Trp Gin His Ala 
130 135 140 

Ala Ala Ala Ala His Gly Ser Phe Arg His Asp Phe Pro Met Lys Arg 
145 150 155 160 

Leu Arg Cys Tyr Ser Asp Gly Gin Ser Cys 
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<210> 47 
<211> 974 
<212> DMA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (62) . . (874) 

<223> G1073 

<400> 47 

ccccccgacc tgcctctaca gagacctgaa gattccagaa ccccacctga tcaaaaataa 60 

c atg gaa ctt aac aga tct gaa gca gac gaa gca aag gcc gag acc act 109 
Met Glu Leu Asn Arg Ser Glu Ala Asp Glu Ala Lys Ala Glu Thr Thr 
15 10 15 

ccc acc ggt gga gcc acc age tea gcc aca gcc tct ggc tct tec tec 157 
Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

gga cgt cgt cca cgt ggt cgt cct gca ggt tec aaa aac aaa ccc aaa 205 
Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

cct ccg acg att ata act aga gat agt cct aac gtc ctt aga tea cac 253 
Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

gtt ctt gaa gtc acc tec ggt teg gac ata tec gag gca gtc tec acc 301 
Val Leu Glu Val Thr Ser Gly Ser Asp lie Ser Glu Ala Val Ser Thr 
65 70 75 80 

tac gcc act cgt cgc ggc tgc ggc gtt tgc att ata age ggc acg ggt 349 
Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys lie lie Ser Gly Thr Gly 

85 90 95 

gcg gtc act aac gtc acg ata egg caa cct gcg get ccg get ggt gga 397 
Ala Val Thr Asn Val Thr He Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 

ggt gtg att acc ctg cat ggt egg ttt gac att ttg tct ttg acc ggt 445 
Gly Val He Thr Leu His Gly Arg Phe Asp He Leu Ser Leu Thr Gly 
115 120 125 

act gcg ctt cca^ccg cct gca cca ccg gga gca gga ggt ttg acg gtg 4 93 

Thr Ala^'Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 

tat eta gcc gga ggt caa gga caa gtt gta gga ggg aat gtg get ggt 541 
Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

teg tta att get teg gga ccg gta gtg ttg atg get get tct ttt gca 589 
Ser Leu He Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 

165 170 175 

aac gca gtt tat gat agg tta ccg att gaa gag gaa gaa acc cca ccg 637 
Asn Ala Val Tyr Asp Arg Leu Pro He Glu Glu Glu Glu Thr Pro Pro 

Page 54 



WO 01/26459 



PCT/US00/28141 



MBI-0021.txt 
180 185 190 

ccg aga acc acc ggg gtg cag cag cag cag ccg gag gcg tct cag teg 68 5 

Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 

teg gag gtt acg ggg agt ggg gee cag gcg tgt gag tea aac etc caa 733 
Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 

ggt gga aat ggt gga gga ggt gtt get ttc tac aat ctt gga atg aat 781 
Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 235 240 

atg aac aat ttt caa ttc tec ggg gga gat att tac ggt atg age ggc 829 
Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 
; 245 250 255 

ggt age gga gga ggt ggt ggc ggt gcg act aga ccc gcg ttt tag 874 
Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 

agttttagcg ttttggtgac accttttgtt gcgtttgcgt gtttgacctc aaactactag 934 

gctactagct atagcggttg egaaatgega atattaggtt 974 

<210> 48 

<211> 270 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 48 

Met Glu Leu Asn Arg Ser Glu Ala Asp Glu Ala Lys Ala Glu Thr Thr 
1 5 10 15 

Pro Thr Gly Gly Ala Thr Ser Ser Ala Thr Ala Ser Gly Ser Ser Ser 
20 25 30 

Gly Arg Arg Pro Arg Gly Arg Pro Ala Gly Ser Lys Asn Lys Pro Lys 
35 40 45 

Pro Pro Thr lie lie Thr Arg Asp Ser Pro Asn Val Leu Arg Ser His 
50 55 60 

Val Leu Glu Val Thr Ser Gly Ser Asp lie Ser Glu Ala Val Ser Thr 
65 70 75 80 

Tyr Ala Thr Arg Arg Gly Cys Gly Val Cys He He Ser Gly Thr Gly 

85 90 95 

Ala Val Thr Asn Val Thr He Arg Gin Pro Ala Ala Pro Ala Gly Gly 
100 105 110 



Gly Val He Thr Leu His Gly Arg Phe Asp He Leu Ser Leu Thr Gly 
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Thr Ala Leu Pro Pro Pro Ala Pro Pro Gly Ala Gly Gly Leu Thr Val 
130 135 140 



Tyr Leu Ala Gly Gly Gin Gly Gin Val Val Gly Gly Asn Val Ala Gly 
145 150 155 160 

Ser Leu lie Ala Ser Gly Pro Val Val Leu Met Ala Ala Ser Phe Ala 

165 170 175 



Asn Ala Val Tyr Asp Arg. Leu Pro lie Glu Glu Glu Glu Thr Pro Pro 
180 185 190 



Pro Arg Thr Thr Gly Val Gin Gin Gin Gin Pro Glu Ala Ser Gin Ser 
195 200 205 



Ser Glu Val Thr Gly Ser Gly Ala Gin Ala Cys Glu Ser Asn Leu Gin 
210 215 220 



Gly Gly Asn Gly Gly Gly Gly Val Ala Phe Tyr Asn Leu Gly Met Asn 
225 230 235 240 



Met Asn Asn Phe Gin Phe Ser Gly Gly Asp lie Tyr Gly Met Ser Gly 

245 250 255 



Gly Ser Gly Gly Gly Gly Gly Gly Ala Thr Arg Pro Ala Phe 
260 265 270 



<210> 


49 


<211> 


1281 


<212> 


DNA 


<213> 


Arabi clops is 


<220> 




<221> 


CDS 


<222> 


(8) . . (904) 


<223> 


G1435 


<400> 


49 



gtgaaac atg ggg aag gaa gtt atg gtg age gat tac ggt gac gac gac 4 9 

Met Gly Lys Glu Val Met Val Ser Asp Tyr Gly Asp Asp Asp 
15 10 

gga gaa gac gec ggc ggc ggc gat gaa tat agg att ccg gaa tgg gaa 97 
Gly Glu Asp Ala Gly Gly Gly Asp Glu Tyr Arg He Pro Glu Trp Glu 
15 20 25 30 

att ggt tta ccc aac gga gat gat ttg act ccg tta tct caa tat eta 145 
He Gly Leu Pro Asn Gly Asp Asp Leu Thr Pro Leu Ser Gin Tyr Leu 

35 40 45 
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gtc ccg teg att etc gcg tta get ttc age atg ate cca gaa cga age 193 

Val Pro Ser lie Leu Ala Leu Ala Phe Ser Met lie Pro Glu Arg Ser ] 
50 55 60 ; 

■ I 

cgt aca att cac gac gtc aat cgc gcg teg caa ate acg etc tct teg 241 ] 

Arg Thr lie His Asp Val Asn Arg Ala Ser Gin lie Thr Leu Ser Ser t 
65 70 75 

ttg aga age agt acc aat get teg tct gtg atg gag gag gtc gtg gat 289 
Leu Arg Ser Ser Thr Asn Ala Ser Ser Val Met Glu Glu Val Val Asp 
80 85 90 

cga gtt gaa teg agt gtt cca gga tea gat ccg aag aaa cag aag aaa 337 
Arg Val Glu Ser Ser Val Pro Gly Ser Asp Pro Lys Lys Gin Lys Lys 
95 100 105 110 

teg gat ggt ggt gaa'gca gcg gcg gtg gag gat tec acg gcg gag gaa 385 
Ser Asp Gly Gly Glu Ala Ala Ala Val Glu Asp Ser Thr Ala Glu Glu 

115 120 125 

gga gac tec ggg cct gaa gac gcg tct ggg aag aca teg aaa cga ccg 433 
Gly Asp Ser Gly Pro Glu Asp Ala Ser Gly Lys Thr Ser Lys Arg Pro 
130 135 140 

cgt tta gtg tgg aca ccg cag eta cac aag aga ttt gtg gac gtt gtg 481 
Arg Leu Val Trp Thr Pro Gin Leu His Lys Arg Phe Val Asp Val Val 
145 150 155 

get cat eta ggg att aaa aac gca gtg ccg aag acg att atg cag ctg 529 
Ala His Leu Gly lie Lys Asn Ala Val Pro Lys Thr lie Met Gin Leu 
160 165 170 

atg aac gtg gaa gga ctt act cgt gag aac gtt gcg tct cat ttg cag 577 
Met Asn Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin 
175 180 185 190 

aaa tat agg ctt tac ctt aaa egg att caa gga ttg acg acg gaa gaa 625 
Lys Tyr Arg Leu Tyr Leu Lys Arg lie Gin Gly Leu Thr Thr Glu Glu 

195 200 205 

gat cct tat teg teg teg gat cag etc ttc tct tea acg ccg gtt cct 673 
Asp Pro Tyr Ser Ser Ser Asp Gin Leu Phe Ser Ser Thr Pro Val Pro 
210 215 220 

cca cag age ttt caa gac ggc gga gga agt aac gga aag ttg ggg gtt 721 
Pro Gin Ser Phe Gin Asp Gly Gly Gly Ser Asn Gly Lys Leu Gly Val 
225 230 235 

ccg gtt ccg gtt ccg teg atg gtg cct att cca ggc tat ggg aat caa 769 
Pro Val Pro Val Pro Ser Met Val Pro lie Pro Gly Tyr Gly Asn Gin 
240 245 250 

atg ggt atg caa gga tat tat caa cag tat agt aac cat ggc aat gaa 817 
Met Gly Met Gin Gly Tyr Tyr Gin Gin Tyr Ser Asn His Gly Asn Glu 
255 260 265 270 

tea aac caa tat atg atg cag cag aat aag ttt gga aca atg gtg aca 865 
Ser Asn Gin Tyr Met Met Gin Gin Asn Lys Phe Gly Thr Met Val Thr 

275 280 285 

tat cct tct gtt ggt ggt ggt gac gtg aat gac aag taa atggatctta 914 
Tyr Pro Ser Val Gly Gly Gly Asp Val Ash Asp Lys 
290 295 
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aaggtctata atttgctcta cagagagata ctggttcttg gcttatggtt tattttccca 974 

cttcatgagg ttgttgtgac ttttaattct ccatgttttc cacacaagtc tttattgcct 1034 

ttgtatagaa aatgatttcg agaaaatcac tgggaagctt ggtattgttg gaggatgaag 1094 

ccttctatga atgatttagt ttcctactgt ctccattctt tatgaggtaa taaagccttc 1154 

ttttgctcat cgcttgtagt cttcttaaat tcaagacagc gtcacatgtt tgttcggtta 1214 

tgttaattgt ttctttcttt ggataatgaa gatagcatca ggtctcatgt ctcctcactt 1274 

tgataaa 1281 

<210> 50 .. 

<211> 298 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 50 

Met Gly Lys Glu Val Met Val Ser Asp Tyr Gly Asp Asp Asp Gly Glu 
1 5 10 15 

Asp Ala Gly Gly Gly Asp Glu Tyr Arg lie Pro Glu Trp Glu lie Gly 
20 25 30 

Leu Pro Asn Gly Asp Asp Leu Thr Pro Leu Ser Gin Tyr Leu Val Pro 
35 40 45 

Ser lie Leu Ala Leu Ala Phe Ser Met lie Pro Glu Arg Ser Arg Thr 
50 55 60 

lie His Asp Val Asn Arg Ala Ser Gin lie Thr Leu Ser Ser Leu Arg 
65 70 75 80 

Ser Ser Thr Asn Ala Ser Ser Val Met Glu Glu Val Val Asp Arg Val 

85 90 95 

» 

Glu Ser Ser Val Pro Gly Ser Asp Pro Lys Lys Gin Lys Lys Ser Asp 
100 105 110 

Gly Gly Glu Ala Ala Ala Val Glu Asp Ser Thr Ala Glu Glu Gly Asp 
115 120 125 

Ser Gly Pro Glu Asp Ala Ser Gly Lys Thr Ser Lys Arg Pro Arg Leu 
130 135 140 

Val Trp Thr Pro Gin Leu His Lys Arg Phe Val Asp Val Val Ala His 
145 150 155 160 
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Leu Gly lie Lys Asn Ala Val Pro Lys Thr lie Met Gin Leu Met Asn 

165 170 175 



Val Glu Gly Leu Thr Arg Glu Asn Val Ala Ser His Leu Gin Lys -Tyr 
180 185 190 



Arg Leu Tyr Leu Lys Arg lie Gin Gly Leu Thr Thr Glu Glu Asp Pro 
195 200 205 



Tyr Ser Ser Ser Asp Gin Leu Phe Ser Ser Thr Pro Val Pro Pro Gin 

210 215 220 

Ser Phe Gin Asp Gly Gly Gly Ser Asn Gly Lys Leu Gly Val Pro Val 

225 230 235 240 



Pro Val Pro Ser Met Val Pro lie Pro Gly Tyr Gly Asn Gin Met Gly 

245 250 255 

Met Gin Gly Tyr Tyr Gin Gin Tyr Ser Asn His Gly Asn Glu Ser Asn 

260 265 270 

Gin Tyr Met Met Gin Gin Asn Lys Phe Gly Thr Met Val Thr Tyr Pro 
275 280 285 



Ser Val Gly Gly Gly Asp Val Asn Asp Lys 
290 295 



<210> 


51 


<211> 


837 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(54) . . (629) 


<223> 


G180 


<400> 


51 



gtaattacga tctacaacaa gtgacatcgt cgtcgacgac gattcaagag aat atg 56 

Met 
1 



aac ttc etc gtt cct ttt gaa gaa acc aat gtc tta acc ttt ttc tct 
Asn Phe Leu Val Pro Phe Glu Glu Thr Asn Val Leu Thr Phe Phe Ser 
5 10 15 



104 



tct tct tct tec tct tct ctt tct tct cct tct ttc ccc att cac aac 152 
Ser Ser Ser Ser Ser Ser Leu Ser Ser Pro Ser Phe Pro lie His Asn 
20 25 30 

tct tec tec act act act act cat gca cct eta ggg ttt tct aat aat 200 
Ser Ser Ser Thr Thr Thr Thr His Ala Pro Leu Gly Phe Ser Asn Asn 
35 40 45 
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ctt cag ggt gga gga ccc ttg gga tea aag gtg gtt aat gat gat cag 248 
Leu Gin Gly Gly Gly Pro Leu Gly Ser Lys Val Val Asn Asp Asp Gin 
50 55 60 65 

gag aat ttt gga ggt gga act aac aat gat get cat tct aat tct tgg 296 
Glu Asn Phe Gly Gly Gly Thr Asn Asn Asp Ala His Ser Asn Ser Trp 

70 75 80 

tgg aga tea aat agt gga agt gga gat atg aag aac aaa gtg aag ata 344 
Trp Arg Ser Asn Ser Gly Ser Gly Asp Met Lys Asn Lys Val Lys lie 
85 90 95 

agg agg aaa eta aga gag cca aga ttc tgt ttc caa ace aaa age gat 392 
Arg Arg Lys Leu Arg Glu Pro Arg Phe Cys Phe Gin Thr Lys Ser Asp 

100 ■ ; .105 110 

gtt gat gtt ctt gac gat ggc tac aaa tgg cgt aaa tat ggt cag aaa 44 0 

Val Asp Val Leu Asp Asp Gly Tyr Lys Trp Arg Lys Tyr Gly Gin Lys 
115 120 125 

gtc gtc aag aac age ctt cac ccc agg agt tat tac aga tgc aca cac 488 
Val Val Lys Asn Ser Leu His Pro Arg Ser Tyr Tyr Arg Cys Thr His 
130 135 140 145 

aac aac tgt agg gtg aaa aag aga gtg gag cga eta teg gaa- gat tgt 536 
Asn Asn Cys Arg Val Lys Lys Arg Val Glu Arg Leu Ser Glu Asp Cys 

150 155 160 

aga atg gtg att act act tac gaa ggt cgt cac aac cac att ccc tct 584 
Arg Met Val lie Thr Thr Tyr Glu Gly Arg His Asn His lie Pro Ser 
165 170 175 

gat gac tec act tct cct gac cat gat tgt etc tct tec ttt taa 629 
Asp Asp Ser Thr Ser Pro Asp His Asp Cys Leu Ser Ser Phe 
180 185 190 

catctctttc tatatatcta tatatagaca gttatatgtg cacatataga tgtgtgatat 689 

attgeatatt tgatattgea tgtgtttttc aagagtatgt catcagatgt tatgeatata 74 9 

ttcttgactt gttgettata gtatacatat gtaataatat atattgacat tggtagttca 809 

tttctgttca aacaaaaaaa aaaaaaaa 837 

<210> 52 
<211> 191 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 52 

Met Asn Phe Leu Val Pro Phe Glu Glu Thr Asn Val Leu Thr Phe Phe 
15 10 15 

Ser Ser Ser Ser Ser Ser Ser Leu Ser Ser Pro Ser Phe Pro lie His 
20 25 30 



Asn Ser Ser Ser Thr Thr Thr Thr His Ala Pro Leu Gly Phe Ser Asn 
35 40 45 
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Asn Leu Gin Gly Gly Gly Pro Leu Gly Ser Lys Val Val Asn Asp Asp 
50 55 60 



Gin Glu Asn Phe Gly Gly Gly Thr Asn Asn Asp Ala His Ser Asn Ser 
65 70 75 80 



Trp Trp Arg Ser Asn Ser Gly Ser Gly Asp Met Lys Asn Lys Val Lys 

85 90 95 



lie Arg Arg Lys Leu Arg Glu Pro Arg Phe Cys Phe Gin Thr Lys Ser 
100 105 110 



Asp Val Asp Val Leu Asp Asp Gly Tyr Lys Trp Arg Lys Tyr Gly Gin 
115 . 120 125 



Lys Val Val Lys Asn Ser Leu His Pro Arg Ser Tyr Tyr Arg Cys Thr 
130 135 140 



His Asn Asn Cys Arg Val Lys Lys Arg Val Glu Arg Leu Ser Glu Asp 
145 150 155 160 



Cys Arg Met Val lie Thr Thr Tyr Glu Gly Arg His Asn His lie Pro 

165 170 175 



Ser Asp Asp Ser Thr Ser Pro Asp His Asp Cys Leu Ser Ser Phe 
180 185 190 



<210> 


53 


<211> 


1413 


<212> 


DNA 


<213> 


Arabidopsis thaliana 


<220> 




<221> 


CDS 


<222> 


(121) . . (1200) 


<223> 


G592 



<400> 53 

aagctattaa gatttggttt tctacaaatt tgttcttcct gaaacgtcac gagacagagc 60 

ttacaagaag agaaaacaga ggaaatttcg ttgcattttt tttacatatt gattcgatta 120 

atg gat tea aat aat cat etc tac gac ccg aat ccc acc ggg teg ggt 168 
Met Asp Ser Asn Asn His Leu Tyr Asp Pro Asn Pro Thr Gly Ser Gly 
1 5 . 10 15 

ctt ctt cgt ttt aga tea get ccg age tct gtt etc gee get ttt gtt 216 
Leu Leu Arg Phe Arg Ser Ala Pro Ser Ser Val Leu Ala Ala Phe Val 
20 25 30 

gac gac gac aag att ggt ttc gac tec gat agg ttg ctt tea aga ttc 264 
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Asp Asp Asp Lys lie Gly Phe Asp Ser Asp Arg Leu Leu Ser Arg Phe 
35 40 45 

gtg acc tct aat ggc gtt aac gga gat ctg ggt tea cct aaa ttc gag 312 
Val Thr Ser Asn Gly Val Asn Gly Asp Leu Gly Ser Pro Lys Phe Glu 
50 55 60 

gat aag tct ccg gtt teg tta acg aac acc tct gtt tea tac gec gec 360 
Asp Lys Ser Pro Val Ser Leu Thr Asn Thr Ser Val Ser Tyr Ala Ala 
65 70 75 80 

act ctg ccg cca ccg ccg cag ctt gag ccg teg agt ttt ctg ggt ttg 408 
Thr Leu Pro Pro Pro Pro Gin Leu Glu Pro Ser Ser Phe Leu Gly Leu 

85 90 95 

ccg ccg cat tac ccg agg cag agt aaa ggg ata atg aac teg gtt ggt 456 
Pro Pro His Tyr Pro Arg Gin Ser Lys Gly lie Met Asn Ser Val Gly 
100 105 110 

ttg gat cag ttt etc ggt ate aat aat cat cac acc aaa cca gtt gaa 504 
Leu Asp Gin Phe Leu Gly lie Asn Asn His His Thr Lys Pro Val Glu 
115 120 125 

tct aat ctt etc cgt caa age age tct cca gec gga atg ttt act aat 552 
Ser Asn Leu Leu Arg Gin Ser Ser Ser Pro Ala Gly Met Phe Thr Asn 
130 135 140 

etc tct gac caa aac ggt tat ggt tea atg agg aat ttg atg aat tac 600 
Leu Ser Asp Gin Asn Gly Tyr Gly Ser Met Arg Asn Leu Met Asn Tyr 
145 150 155 160 

gaa gaa gat gaa gag agt cca tct aat tec aat gga tta aga cgc cat 64 8 

Glu Glu Asp Glu Glu Ser Pro Ser Asn Ser Asn Gly Leu Arg Arg His 

165 170 175 

tgc agt etc tct tea agg cca cct tct tea ctt gga atg ctt tct caa 696 
Cys Ser Leu Ser Ser Arg Pro Pro Ser Ser Leu Gly Met Leu Ser Gin 
180 185 190 

ata cct gaa ate gca ccc gaa act aat ttt cca tat age cat tgg aat 744 
lie Pro Glu lie Ala Pro Glu Thr Asn Phe Pro Tyr Ser His Trp Asn 
195 200 205 

gat cca tec age ttt att gat aac tta tec tea ctt aaa aga gaa gee 792 
Asp Pro Ser Ser Phe lie Asp Asn Leu Ser Ser Leu Lys Arg Glu Ala 
210 215 220 

gag gac gat gga aaa ttg ttt etc gga get cag aac gga gag tec ggg 84 0 

Glu Asp Asp Gly Lys Leu Phe Leu Gly Ala Gin Asn Gly Glu Ser Gly 
225 230 235 240 

aat cgt atg cag tta ctg teg cat cat ttg age eta cca aag tea tea 888 
Asn Arg Met Gin Leu Leu Ser His His Leu Ser Leu Pro Lys Ser Ser 

245 250 255 

teg aca gec teg gac atg gtt tea gtg gat aag tat ctt cag eta caa 936 
Ser Thr Ala Ser Asp Met' Val Ser Val Asp Lys Tyr Leu Gin Leu Gin 
260 265 270 

gat tct gtt cct tgt aaa ate aga gee aaa cgt ggt tgc get aca cat 984 
Asp Ser Val Pro Cys Lys lie Arg Ala Lys Arg Gly Cys Ala Thr His 
275 280 285 
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cct cga age ate get gaa egg gta aga aga acg egg .ata age gag cga 1032 
Pro Arg Ser lie Ala Glu Arg Val Arg Arg Thr Arg lie Ser Glu Arg 
290 295 300 

atg agg aag tta caa gag ctt gtt cct aac atg gac aag caa acc aac 1080 
Met Arg Lys Leu Gin Glu Leu Val Pro Asn Met Asp Lys Gin Thr Asn 
305 310 315 320 

act teg gat atg ttg gat tta get gtg gat tac ate aaa gat tta caa 112 8 
Thr Ser Asp Met Leu Asp Leu Ala Val Asp Tyr lie Lys Asp Leu Gin 

325 330 335 

aga cag tat aag att tta aac gac aac aga get aac tgt aag tgt atg 1176 
Arg Gin Tyr Lys lie Leu Asn Asp Asn Arg Ala Asn Cys Lys Cys Met 
340 345 350 

aac aag gag aag aag tea ata tag ggcgcaacaa agtgtgtagt agataggact . 12 30 
Asn Lys Glu Lys Lys Ser lie 
355 

aaaaagcagg gagaaggaca agaaagaaac aatgtcatgt ctgaatattt tttagccgaa 12 90 

acagaccaaa ttgtctatgt aagctctcga gaaaagcatc tgcttccaac aaaattctaa 1350 

gtaataaaat agtactcgat ttgttcttat ttcattatta caatgeagaa tctactaatc 1410 
aaa 1413 



<210> 


54 


<211> 


359 


<212> 


PRT 


<213> 


Arabidopsis thaliana 


<400> 


54 


Met Asp Ser Asn Asn His Leu 


1 


5 



10 15 

Leu Leu Arg Phe Arg Ser Ala Pro Ser Ser Val Leu Ala Ala Phe Val 
20 25 30 

Asp Asp Asp Lys lie Gly Phe Asp Ser Asp Arg Leu Leu Ser Arg Phe 
35 40 45 

Val Thr Ser Asn Gly Val Asn Gly Asp Leu Gly Ser Pro Lys Phe Glu 
50 55 60 

Asp Lys Ser Pro Val Ser Leu Thr Asn Thr Ser Val Ser Tyr Ala Ala 
65 70 75 80 

Thr Leu Pro Pro Pro Pro Gin Leu Glu Pro Ser Ser Phe Leu Gly Leu 

85 90 95 

Pro Pro His Tyr Pro Arg Gin Ser Lys Gly He Met Asn Ser Val Gly 
100 105 110 
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Leu Asp Gin Phe Leu Gly lie Asn Asn His His Thr Lys Pro Val Glu 
115 120 125 



Ser Asn Leu Leu Arg Gin Ser Ser Ser Pro Ala Gly Met Phe Thr Asn 
130 135 140 



Leu Ser Asp Gin Asn Gly Tyr Gly Ser Met Arg Asn Leu Met Asn Tyr 
145 150 155 160 



Glu Glu Asp Glu Glu Ser Pro Ser Asn Ser Asn Gly Leu Arg Arg His 

165 170 175 



Cys Ser Leu Ser Ser Arg Pro Pro Ser Ser Leu Gly Met Leu Ser Gin 
180 185 190 



lie Pro Glu lie Ala Pro Glu Thr Asn Phe Pro Tyr Ser His Trp Asn 
195 200 205 



Asp Pro Ser Ser Phe lie Asp Asn Leu Ser Ser Leu Lys Arg Glu Ala 
210 215 220 



Glu Asp Asp Gly Lys Leu Phe Leu Gly Ala Gin Asn Gly Glu Ser Gly 
225 230 235 240 



Asn Arg Met Gin Leu Leu Ser His His Leu Ser Leu Pro Lys Ser Ser 

245 250 255 



Ser Thr Ala Ser Asp Met Val Ser Val Asp Lys Tyr Leu Gin Leu Gin 
260 265 270 



Asp Ser Val Pro Cys Lys lie Arg Ala Lys Arg Gly Cys Ala Thr His 
275 280 285 



Pro Arg Ser lie Ala Glu Arg Val Arg Arg Thr Arg lie Ser Glu Arg 
290 295 300 



Met Arg Lys Leu Gin Glu Leu Val Pro Asn Met Asp Lys Gin Thr Asn 
305 310 315 320 



Thr Ser Asp Met Leu Asp Leu Ala Val Asp Tyr lie Lys Asp Leu Gin 

325 330 335 



Arg Gin Tyr Lys lie Leu Asn Asp Asn Arg Ala Asn Cys Lys Cys Met 
340 345 350 



Asn Lys Glu Lys Lys Ser lie 
355 
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<210> 


55 


<211> 


764 


<212> 


DNA 


<213> 


Arabidopsis 


<220> 




<221> 


CDS 


<222> 


(15) . . (725) 


<223> 


G208 


<400> 


55 



cttcttcatt cacc atg gga aga tct cct tgt tgt gaa aaa get cac aca 50 

Met Gly Arg Ser Pro Cys Cys Glu Lys Ala His Thr 

; 1 ' ' ' 5 10 

aac aaa gga get tgg act aaa gaa gaa gat caa cgt etc gta gat tat 98 
Asn Lys Gly Ala Trp Thr Lys Glu Glu Asp Gin Arg Leu Val Asp Tyr 
15' 20 25 

ate cgt aat cac ggt gaa ggt tgt tgg cgt tct ctt cct aaa tec get 14 6 

lie Arg Asn His Gly Glu Gly Cys Trp Arg Ser Leu Pro Lys Ser Ala 
30 35 40 

gga ttg ttg cgt tgt ggt aaa agt tgt aga ttg aga tgg att aat tac 194 
Gly Leu Leu Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp lie Asn Tyr 
45 50 55 60 

ctt cgt cct gat ctt aaa cgt ggt aat ttt act gat gat gaa gat caa 242 
Leu Arg* Pro Asp Leu Lys Arg Gly Asn Phe Thr Asp Asp Glu Asp Gin 

65 70 75 

ate ate ate aaa etc cat age tta etc ggt aac aaa tgg tea ttg ata 2 90 

lie lie lie Lys Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu lie 
80 85 90 

get gga aga tta cca gga aga aca gat aac gaa ata aag aat tat tgg 338 
Ala Gly Arg Leu Pro Gly Arg Thr Asp Asn Glu lie Lys Asn Tyr Trp 
95 100 105 

aac act cat att aag agg aag ctt ctt agt cac ggt att gat cca caa 386 
Asn Thr His lie Lys Arg Lys Leu Leu Ser His Gly lie Asp Pre Gin 
110 115 120 

act cat cgt cag att aac gaa tec aaa acg gtg teg tct caa gtt gtt 434 
Thr His Arg Gin lie Asn Glu Ser Lys Thr Val Ser Ser Gin Val Val 
125 130 135 140 

gtt cct att caa aac gat gee gtt gag tat tct ttt tec aat tta gec 4 82 

Val Pro lie Gin Asn Asp Ala Val Glu Tyr Ser Phe Ser Asn Leu Ala 

145 150 155 

gtt aaa ccg aag acg gaa aat tec tec gat aac gga get teg act age 530 
Val Lys Pro Lys Thr Glu Asn Ser Ser Asp Asn Gly Ala Ser Thr Ser 
160 165 170 

ggc acg acg acg gac gag gat etc egg cag aat ggg gag tgt tat tat 57 8 

Gly Thr Thr Thr Asp Glu Asp Leu Arg Gin Asn Gly Glu Cys Tyr Tyr 
175 180 185 

agt gat aat tea gga cat ata aag ctg aat ttg gat tta act ctt ggg 626 
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Ser Asp Asn Ser Gly His lie Lys Leu Asn Leu Asp Leu Thr Leu Gly 
190 195 200 

ttt gga tec tgg teg ggt egg ata gtc gga gtc ggg tea teg get gat 674 
Phe Gly Ser Trp Ser Gly Arg He Val Gly Val Gly Ser Ser Ala Asp 
205 210 215 220 

tct aaa ccg tgg tgc gac ccg gtg atg gag gcg cgt ttg tea ctg ttg 722 
Ser Lys Pro Trp Cys Asp Pro Val Met Glu Ala Arg Leu Ser Leu Leu 

225 230 235 

taa taatttgtca aaaaaatccc aaaaaatggg tttgttaaa 764 

<210> 56 

<211> 236 

<212> PRT : 

<213> Arabidopsis thai i ana 

<400> 56 

Met Gly Arg Ser Pro Cys Cys Glu Lys Ala His Thr Asn Lys Gly Ala 
1 5 . 10 15 

Trp Thr Lys Glu Glu Asp Gin Arg Leu Val Asp Tyr He Arg Asn His 
20 25 30 

Gly Glu Gly Cys Trp Arg Ser Leu Pro Lys Ser Ala Gly Leu Leu Arg 
35 40 45 

Cys Gly Lys Ser Cys Arg Leu Arg Trp He Asn Tyr Leu Arg Pro Asp 
50 55 60 

Leu Lys Arg Gly Asn Phe Thr Asp Asp Glu Asp Gin He He He Lys 
65 70 75 80 

Leu His Ser Leu Leu Gly Asn Lys Trp Ser Leu He Ala Gly Arg Leu 

85 90 95 

Pro Gly Arg Thr Asp Asn Glu He Lys Asn Tyr Trp Asn Thr His He 
100 105 110 

Lys Arg Lys Leu Leu Ser His Gly He Asp Pro Gin Thr His Arg Gin 
115 120 125 

He Asn Glu Ser Lys Thr Val Ser Ser Gin Val Val Val Pro He Gin 
130 135 140 

Asn Asp Ala Val Glu Tyr Ser Phe Ser Asn Leu Ala Val Lys Pro Lys 
145 150 155 160 

Thr Glu Asn Ser Ser Asp Asn Gly Ala Ser Thr Ser Gly Thr Thr Thr 

165 170 175 
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Asp Glu Asp 



Leu Arg Gin Asn Gly Glu Cys Tyr Tyr Ser Asp Asn Ser 
180 185 190 



Gly His He 
195 



Lys Leu Asn Leu Asp Leu Thr Leu Gly Phe Gly Ser Trp 

200 205 



Ser Gly Arg 
210 



He Val Gly Val Gly Ser Ser Ala Asp Ser Lys Pro Trp 
215 220 



Cys Asp Pro 
225 



Val Met Glu Ala Arg Leu Ser Leu Leu 
230 235 



<210> 57 

<211> 28 

<212> DNA 

<213> synthetic oligonucleotide 
<220> 

<221> misc_f eature 

<222> ()..() 

<223> 025908 



<400> 57 

ggcataaccc ttatcggaga tttgaagc 28 



<210> 


58 


<211> 


28 


<212> 


DNA 


<213> 


synthetic oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


()..() 


<223> 


025910 


<400> 


58 


acacaaactc tgatcttgtc tccgaagg 


<210> 


59 


<211> 


30 


<212> 


DNA 


<213> 


synthetic oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


()..<) 


<223> 


028990 



.28 



<400> 59 

gcataaccct tatcggagat ttgaagccat 
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<210> 60 

<211> 30 

<212> DNA 

<213> synthetic oligonucleotide 
<220> 

<22l> misc_feature 

<222> ()..{) 

<223> 028991 



<400> 60 

aacattcctc tctcatcatc tgttgccagc 



<210> 61 

<211> 28 

<212> DNA 

<213> synthetic oligonucleotide 
<220> 

<221> misc_f eature 

<222> ()..() 

<223> 025907 



<400> 61 

aacgcttagt atctccggcg acttgaac 



<210> 


62 


<211> 


28 


<212> 


DNA 


<213> 


synthetic oligonucleotide 


<220> 




<221> 


raise feature 


<222> 


()-.() 


<223> 


025909 


<400> 


62 


ctcacacgaa taaggtacaa agttcatc 


<210> 


63 


<211> 


30 


<212> 


DNA 


<213> 


synthetic oligonucleotide 


<400> 


63 


ttagtatctc cggcgacttg aacccaaacc 


<210> 


64 


<211> 


30 


<212> 


DNA 



<213> synthetic oligonucleotide 



<220> 

<221> misc_feature 

<222> ()..() 

<223> 028985 



MBI-0021.txt 



30 



28 



28 



30 
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<400> 64 

agattctcaa caagcttcaa catgagttcg 30 
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